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THE MOLECULAR CHALLENGE 


Sir Ethylene, to scientists fair prey, 

(Who dig and delve and peek and push and pry, 

And prove their findings with equations sly) 

Smoothed out his ruffled orbitals, to say: 

“I stand in symmetry. Mine is a way 

Of mystery and magic. Ancient, I 

Am also deemed immortal. Should I die, 

Pi would be in the sky, and Judgement Day 

Would be upon us. For all things must fail, 

That hold our universe together, when 

Bonds such as bind me fail, and fall asunder. 

Hence, stand I firm against the endless hail 

Of scientific blows. I yield not.” Men 

And their computers stand and stare and wonder. 
W.G. LOWE 
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Preface to the Third Edition 


We have attempted to improve and update this text while retaining the features that 
make it unique, namely, an emphasis on physical understanding, and the ability to 
estimate, evaluate, and predict results without blind reliance on computers, while still 
maintaining rigorous connection to the mathematical basis for quantum chemistry. We 
have inserted into most chapters examples that allow important points to be emphasized, 
clarified, or extended. This has enabled us to keep intact most of the conceptual 
development familiar to past users. In addition, many of the chapters now include 
multiple choice questions that students are invited to solve in their heads. This is not 
because we think that instructors will be using such questions. Rather it is because we 
find that such questions permit us to highlight some of the definitions or conclusions 
that students often find most confusing far more quickly and effectively than we can 
by using traditional problems. Of course, we have also sought to update material 
on computational methods, since these are changing rapidly as the field of quantum 
chemistry matures. 

This book is written for courses taught at the first-year graduate/senior undergraduate 
levels, which accounts for its implicit assumption that many readers will be relatively 
unfamiliar with much of the mathematics and physics underlying the subject. Our 
experience over the years has supported this assumption; many chemistry majors are 
exposed to the requisite mathematics and physics, yet arrive at our courses with poor 
understanding or recall of those subjects. That makes this course an opportunity for 
such students to experience the satisfaction of finally seeing how mathematics, physics, 
and chemistry are intertwined in quantum chemistry. It is for this reason that treatments 
of the simple and extended Hückel methods continue to appear, even though these are no 
longer the methods of choice for serious computations. These topics nevertheless form 
the basis for the way most non-theoretical chemists understand chemical processes, 
just as we tend to think about gas behavior as “ideal, with corrections." 
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Preface to the Second Edition 


The success of the first edition has warranted a second. The changes I have made reflect 
my perception that the book has mostly been used as a teaching text in introductory 
courses. Accordingly, I have removed some of the material in appendixes on mathemat- 
ical details of solving matrix equations on a computer. Also I have removed computer 
listings for programs, since these are now commonly available through commercial 
channels. I have added a new chapter on MO theory of periodic systems—a subject 
of rapidly growing importance in theoretical chemistry and materials science and one 
for which chemists still have difficulty finding appropriate textbook treatments. I have 
augmented discussion in various chapters to give improved coverage of time-dependent 
phenomena and atomic term symbols and have provided better connection to scatter- 
ing as well as to spectroscopy of molecular rotation and vibration. The discussion 
on degenerate-level perturbation theory is clearer, reflecting my own improved under- 
standing since writing the first edition. There is also a new section on operator methods 
for treating angular momentum. Some teachers are strong adherents of this approach, 
while others prefer an approach that avoids the formalism of operator techniques. To 
permit both teaching methods, I have placed this material in an appendix. Because this 
edition is more overtly a text than a monograph, I have not attempted to replace older 
literature references with newer ones, except in cases where there was pedagogical 
benefit. 

A strength of this book has been its emphasis on physical argument and analogy (as 
opposed to pure mathematical development). I continue to be a strong proponent of 
the view that true understanding comes with being able to “see” a situation so clearly 
that one can solve problems in one's head. There are significantly more end-of-chapter 
problems, a number of them of the “by inspection" type. There are also more questions 
inviting students to explain their answers. I believe that thinking about such questions, 
and then reading explanations from the answer section, significantly enhances learning. 

Itis the fashion today to focus on state-of-the-art methods for just about everything. 
The impact of this on education has, I feel, been disastrous. Simpler examples are often 
needed to develop the insight that enables understanding the complexities of the latest 
techniques, but too often these are abandoned in the rush to get to the "cutting edge." 
For this reason I continue to include a substantial treatment of simple Hückel theory. 
It permits students to recognize the connections between MOs and their energies and 
bonding properties, and it allows me to present examples and problems that have max- 
imum transparency in later chapters on perturbation theory, group theory, qualitative 
MO theory, and periodic systems. I find simple Hückel theory to be educationally 
indispensable. 
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XX Preface to the Second Edition 


Much of the new material in this edition results from new insights I have developed 
in connection with research projects with graduate students. The work of all four of 
my students since the appearance of the first edition is represented, and I am delighted 
to thank Sherif Kafafi, John LaFemina, Maribel Soto, and Deb Camper for all I have 
learned from them. Special thanks are due to Professor Terry Carlton, of Oberlin 
College, who made many suggestions and corrections that have been adopted in the 
new edition. 

Doubtless, there are new errors. I would be grateful to learn of them so that future 
printings of this edition can be made error-free. Students or teachers with comments, 
questions, or corrections are more than welcome to contact me, either by mail at the 
Department of Chemistry, 152 Davey Lab, The Pennsylvania State University, Univer- 
sity Park, PA 16802, or by e-mail directed to JL3 at PSUVM.PSU.EDU. 


$$$ 


Preface to the First Edition 


My aim in this book is to present a reasonably rigorous treatment of molecular orbital 
theory, embracing subjects that are of practical interest to organic and inorganic as well 
as physical chemists. My approach here has been to rely on physical intuition as much 
as possible, first solving a number of specific problems in order to develop sufficient 
insight and familiarity to make the formal treatment of Chapter 6 more palatable. My 
own experience suggests that most chemists find this route the most natural. 

I have assumed that the reader has at some time learned calculus and elementary 
physics, but I have not assumed that this material is fresh in his or her mind. Other 
mathematics is developed as it is needed. The book could be used as a text for under- 
graduate or graduate students in a half or full year course. The level of rigor of the book 
is somewhat adjustable. For example, Chapters 3 and 4, on the harmonic oscillator and 
hydrogen atom, can be truncated if one wishes to know the nature of the solutions, but 
not the mathematical details of how they are produced. 

I have made use of appendixes for certain of the more complicated derivations or 
proofs. This is done in order to avoid having the development of major ideas in the 
text interrupted or obscured. Certain of the appendixes will interest only the more 
theoretically inclined student. Also, because I anticipate that some readers may wish 
to skip certain chapters or parts of chapters, I have occasionally repeated information 
so that a given chapter will be less dependent on its predecessors. This may seem 
inelegant at times, but most students will more readily forgive repetition of something 
they already know than an overly terse presentation. 

I have avoided early usage of bra-ket notation. I believe that simultaneous intro- 
duction of new concepts and unfamiliar notation is poor pedagogy. Bra-ket notation is 
used only after the ideas have had a change to jell. 

Problem solving is extremely important in acquiring an understanding of quantum 
chemistry. I have included a fair number of problems with hints for a few of them in 
Appendix 14 and answers for almost all of them in Appendix 15.! 

It is inevitable that one be selective in choosing topics for a book such as this. This 
book emphasizes ground state MO theory of molecules more than do most introductory 
texts, with rather less emphasis on spectroscopy than is usual. Angular momentum 
is treated at a fairly elementary level at various appropriate places in the text, but 
it is never given a full-blown formal development using operator commutation rela- 
tions. Time-dependent phenomena are not included. Thus, scattering theory is absent, 


lIn this Second Edition, these Appendices are numbered Appendix 12 and 13. 
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xxii Preface to the First Edition 


although selection rules and the transition dipole are discussed in the chapter on time- 
independent perturbation theory. Valence-bond theory is completely absent. If I have 
succeeded in my effort to provide a clear and meaningful treatment of topics relevant to 
modern molecular orbital theory, it should not be difficult for an instructor to provide 
for excursions into related topics not covered in the text. 

Over the years, many colleagues have been kind enough to read sections of the 
evolving manuscript and provide corrections and advice. I especially thank L. P. Gold 
and O. H. Crawford, who cheerfully bore the brunt of this task. 

Finally, I would like to thank my father, Wesley G. Lowe, for allowing me to include 
his sonnet, “The Molecular Challenge.” 


Chapter 1 


Classical Waves 
and the Time-Independent 
Schródinger Wave Equation 


[J 1-1 Introduction 


The application of quantum-mechanical principles to chemical problems has revolu- 
tionized the field of chemistry. Today our understanding of chemical bonding, spectral 
phenomena, molecular reactivities, and various other fundamental chemical problems 
rests heavily on our knowledge of the detailed behavior of electrons in atoms and 
molecules. In this book we shall describe in detail some of the basic principles, 
methods, and results of quantum chemistry that lead to our understanding of electron 
behavior. 

Inthe first few chapters we shall discuss some simple, but important, particle systems. 
This will allow us to introduce many basic concepts and definitions in a fairly physical 
way. Thus, some background will be prepared for the more formal general development 
of Chapter 6. In this first chapter, we review briefly some of the concepts of classical 
physics as well as some early indications that classical physics is not sufficientto explain 
all phenomena. (Those readers who are already familiar with the physics of classical 
waves and with early atomic physics may prefer to jump ahead to Section 1-7.) 


[J 1-2 Waves 
1-2.A Traveling Waves 


A very simple example of a traveling wave is provided by cracking a whip. A pulse of 
energy is imparted to the whipcord by a single oscillation of the handle. This results 
in a wave which travels down the cord, transferring the energy to the popper at the end 
of the whip. In Fig. 1-1, an idealization of the process is sketched. The shape of the 
disturbance in the whip is called the wave profile and is usually symbolized w(x). The 
wave profile for the traveling wave in Fig. 1-1 shows where the energy is located at a 
given instant. It also contains the information needed to tell how much energy is being 
transmitted, because the height and shape of the wave reflect the vigor with which the 
handle was oscillated. 
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Figure 1-1 » Cracking the whip. As time passes, the disturbance moves from left to right along 
the extended whip cord. Each segment of the cord oscillates up and down as the disturbance passes 
by, ultimately returning to its equilibrium position. 


The feature common to all traveling waves in classical physics is that energy is trans- 
mitted through a medium. The medium itself undergoes no permanent displacement; 
it merely undergoes local oscillations as the disturbance passes through. 

One of the most important kinds of wave in physics is the harmonic wave, for which 
the wave profile is a sinusoidal function. A harmonic wave, at a particular instant in time, 
is sketched in Fig. 1-2. The maximum displacement of the wave from the rest position 
is the amplitude of the wave, and the wavelength à is the distance required to enclose 
one complete oscillation. Such a wave would result from a harmonic! oscillation at 
one end of a taut string. Analogous waves would be produced on the surface of a quiet 
pool by a vibrating bob, or in air by a vibrating tuning fork. 

At the instant depicted in Fig. 1-2, the profile is described by the function 


V (x) = Asin(2zx/A) (1-1) 


(y — 0 when x = 0, and the argument of the sine function goes from 0 to 277, encom- 
passing one complete oscillation as x goes from 0 to A.) Let us suppose that the situation 
in Fig. 1-2 pertains at the time t = 0, and let the velocity of the disturbance through the 
medium be c. Then, after time f, the distance traveled is ct, the profile is shifted to the 
right by ct and is now given by 


V(x,t) = Asin Qzx/2)(x — ct)] (1-2) 


Figure 1-2 » A harmonic wave at a particular instant in time. A is the amplitude and A is the 
wavelength. 


1A harmonic oscillation is one whose equation of motion has a sine or cosine dependence on time. 
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A capital V is used to distinguish the time-dependent function (1-2) from the time- 
independent function (1-1). 

The frequency v of a wave is the number of individual repeating wave units passing 
a point per unit time. For our harmonic wave, this is the distance traveled in unit time 
c divided by the length of a wave unit A. Hence, 


v» —c/A (1-3) 
Note that the wave described by the formula 
V'(x,t) = Asin[Qz/AX)(x — ct) 4- €] (1-4) 


is similar to V of Eq. (1-2) except for being displaced. If we compare the two waves 
at the same instant in time, we find Y’ to be shifted to the left of Y by eX/2z. If 
€ — 1,37, ... , then V' is shifted by 4/2, 3/2, ... and the two functions are said to be 
exactly out of phase. If € — 27r, 47, ... , the shift is by A, 24, ... , and the two waves 
are exactly in phase. e is the phase factor for Y’ relative to V. Alternatively, we can 
compare the two waves at the same point in x, in which case the phase factor causes 
the two waves to be displaced from each other in time. 


1-2.B Standing Waves 


In problems of physical interest, the medium is usually subject to constraints. For 
example, a string will have ends, and these may be clamped, as in a violin, so that 
they cannot oscillate when the disturbance reaches them. Under such circumstances, 
the energy pulse is unable to progress further. It cannot be absorbed by the clamping 
mechanism if it is perfectly rigid, and it has no choice but to travel back along the string 
in the opposite direction. The reflected wave is now moving into the face of the primary 
wave, and the motion of the string is in response to the demands placed on it by the two 
simultaneous waves: 


V (x, t) = Wprimary (x, t) + Wrettected X , £) (1-5) 


When the primary and reflected waves have the same amplitude and speed, we can 
write 


V(x,t) = Asin[2z/A)(x — ct)] + Asin[Q7/A)(x + ct)] 
= 2Asin(2zx/A)cos(2z ct /A) (1-6) 


This formula describes a standing wave—a wave that does not appear to travel through 
the medium, but appears to vibrate "in place." The first part of the function depends 
only on the x variable. Wherever the sine function vanishes, V will vanish, regardless 
of the value of t. This means that there are places where the medium does not ever 
vibrate. Such places are called nodes. Between the nodes, sin(27: x/A) is finite. As 
time passes, the cosine function oscillates between plus and minus unity. This means 
that V oscillates between plus and minus the value of sin(2z:x/A). We say that the x- 
dependent part of the function gives the maximum displacement of the standing wave, 
and the t-dependent part governs the motion of the medium back and forth between 
these extremes of maximum displacement. A standing wave with a central node is 
shown in Fig. 1-3. 
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Figure 1-3 » A standing wave in a string clamped at x = 0 and x = L. The wavelength A is equal 
to L. 


Equation (1-6) is often written as 
V (x, t) — v(x)cos(ot) (1-7) 
where 
e -—2mnc/À (1-8) 


The profile y(x) is often called the amplitude function and w is the frequency factor. 

Let us consider how the energy is stored in the vibrating string depicted in Fig. 1-3. 
The string segments at the central node and at the clamped endpoints of the string 
do not move. Hence, their kinetic energies are zero at all times. Furthermore, since 
they are never displaced from their equilibrium positions, their potential energies are 
likewise always zero. Therefore, the total energy stored at these segments is always 
zero as long as the string continues to vibrate in the mode shown. The maximum kinetic 
and potential energies are associated with those segments located at the wave peaks 
and valleys (called the antinodes) because these segments have the greatest average 
velocity and displacement from the equilibrium position. A more detailed mathematical 
treatment would show that the total energy of any string segment is proportional to 
y (x)? (Problem 1-7). 


[J 1-3 The Classical Wave Equation 


Itis one thing to draw a picture of a wave and describe its properties, and quite another 
to predict what sort of wave will result from disturbing a particular system. To make 
such predictions, we must consider the physical laws that the medium must obey. One 
condition is that the medium must obey Newton's laws of motion. For example, any 
segment of string of mass m subjected to a force F must undergo an acceleration of F/m 
in accord with Newton's second law. In this regard, wave motion is perfectly consistent 
with ordinary particle motion. Another condition, however, peculiar to waves, is that 
each segment of the medium is "attached" to the neighboring segments so that, as 
it is displaced, it drags along its neighbor, which in turn drags along its neighbor, 
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Figure 1-4 » A segment of string under tension 7. The forces at each end of the segment are 
decomposed into forces perpendicular and parallel to x. 


etc. This provides the mechanism whereby the disturbance is propagated along the 
medium.” 

Let us consider a string under a tensile force 7. When the string is displaced from 
its equilibrium position, this tension is responsible for exerting a restoring force. For 
example, observe the string segment associated with the region x to x + dx in Fig. 1-4. 
Note that the tension exerted at either end of this segment can be decomposed into 
components parallel and perpendicular to the x axis. The parallel component tends to 
stretch the string (which, however, we assume to be unstretchable), the perpendicular 
component acts to accelerate the segment toward or away from the rest position. At 
the right end of the segment, the perpendicular component F divided by the horizontal 
component gives the slope of T. However, for small deviations of the string from 
equilibrium (that is, for small angle o) the horizontal component is nearly equal in 
length to the vector T. This means that it is a good approximation to write 


slope of vector T= F/T atx+dx (1-9) 
But the slope is also given by the derivative of V, and so we can write 
Fy44y =T (0V/0x), |a. (1-10) 


At the other end of the segment the tensile force acts in the opposite direction, and we 
have 


F; =—-T(0W/0x), (1-11) 
The net perpendicular force on our string segment is the resultant of these two: 
F — T [(Q/0x), 44s — (OV/Ax)x] (1-12) 


The difference in slope at two infinitesimally separated points, divided by dx, is by 
definition the second derivative of a function. Therefore, 


F=T 0°W/dx? dx (1-13) 


2Fluids are of relatively low viscosity, so the tendency of one segment to drag along its neighbor is weak. For 
this reason fluids are poor transmitters of transverse waves (waves in which the medium oscillates in a direction 
perpendicular to the direction of propagation). In compression waves, one segment displaces the next by pushing 
it. Here the requirement is that the medium possess elasticity for compression. Solids and fluids often meet this 
requirement well enough to transmit compression waves. The ability of rigid solids to transmit both wave types 
while fluids transmit only one type is the basis for using earthquake-induced waves to determine how deep the 
solid part of the earth’s mantle extends. 
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Equation (1-13) gives the force on our string segment. If the string has mass m per 
unit length, then the segment has mass m dx, and Newton's equation F — ma may be 
written 


T 0°W/dx? 2m 9?wJ9t? (1-14) 


where we recall that acceleration is the second derivative of position with respect to time. 

Equation (1-14) is the wave equation for motion in a string of uniform density 
under tension T. It should be evident that its derivation involves nothing fundamental 
beyond Newton's second law and the fact that the two ends of the segment are linked 
to each other and to a common tensile force. Generalizing this equation to waves in 
three-dimensional media gives 


9^ 2 - T 5 jm (x, y, z, f) (1-15) 
t t VZ, mE e 
c gram ve are 


where f is a composite of physical quantities (analogous to m/T) for the particular 
system. 

Returning to our string example, we have in Eq. (1-14) a time-dependent differential 
equation. Suppose we wish to limit our consideration to standing waves that can be 
separated into a space-dependent amplitude function and a harmonic time-dependent 
function. Then 


W(x, t) = v(x)cos(ot) (1-16) 


and the differential equation becomes 


2 2 
eon” T oa = Z Jo. esen = axa cos(wt) (1-17) 
or, dividing by cos (œt), 
d^y(x)/dx? 2 —(o^m/ T) (x) (1-18) 


This is the classical time-independent wave equation for a string. 

We can see by inspection what kind of function (x) must be to satisfy Eq. (1-18). 
V is a function that, when twice differentiated, is reproduced with a coefficient of 
—o?m|/ T. One solution is 


Y= Asin (o m/Tx) (1-19) 


This illustrates that Eq. (1-18) has sinusoidally varying solutions such as those discussed 
in Section 1-2. Comparing Eq. (1-19) with (1-1) indicates that 27/4 = w./m/T. 
Substituting this relation into Eq. (1-18) gives 


d^y (x)/dx? 2 —Qn/AY (x) (1-20) 


which is a more useful form for our purposes. 
For three-dimensional systems, the classical time-independent wave equation for an 
isotropic and uniform medium is 


(8? /ax? + 92/83? + 02/8! )w (x, y, D 2 Qm AY Y (x, y, z) (1-21) 
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where A depends on the elasticity of the medium. The combination of partial derivatives 
on the left-hand side of Eq. (1-21) is called the Laplacian, and is often given the short- 
hand symbol V? (del squared). This would give for Eq. (1-21) 


V- w(x, y, z) 2 —Qn/XY Vr (x, y, z) (1-22) 


E 1-4 Standing Waves in a Clamped String 


We now demonstrate how Eq. (1-20) can be used to predict the nature of standing waves 
in a string. Suppose that the string is clamped at x — 0 and L. This means that the 
string cannot oscillate at these points. Mathematically this means that 


V (0) — y(.) -0 (1-23) 


Conditions such as these are called boundary conditions. Our question is, “What 
functions y satisfy Eq. (1-20) and also Eq. (1-23)?" We begin by trying to find the most 
general equation that can satisfy Eq. (1-20). We have already seen that A sin(2z x/A) 
is a solution, but it is easy to show that A cos(27: x /A) is also a solution. More general 
than either of these is the linear combination? 


V (x) — Asin(2zx/AX) 4- Bcos(2ztx/AX) (1-24) 


By varying A and B, we can get different functions y. 

There are two remarks to be made at this point. First, some readers will have 
noticed that other functions exist that satisfy Eq. (1-20). These are A expQzix/A) 
and Aexp(—2zix/AX), where i = ./—1. The reason we have not included these in 
the general function (1-24) is that these two exponential functions are mathematically 
equivalent to the trigonometric functions. The relationship is 


exp(tikx) =cos(kx) +i sin(kx). (1-25) 


This means that any trigonometric function may be expressed in terms of such exponen- 
tials and vice versa. Hence, the set of trigonometric functions and the set of exponentials 
is redundant, and no additional flexibility would result by including exponentials in 
Eq. (1-24) (see Problem 1-1). The two sets of functions are linearly dependent.* 

The second remark is that for a given A and B the function described by Eq. (1-24) 
is a single sinusoidal wave with wavelength A. By altering the ratio of A to B, we cause 
the wave to shift to the left or right with respect to the origin. If 4 = 1 and B — 0, the 
wave has a node at x = 0. If 4 =0 and B = 1, the wave has an antinode at x — 0. 

We now proceed by letting the boundary conditions determine the constants A and B. 
The condition at x = 0 gives 


V (0) = Asin(0) + B cos(0) 20 (1-26) 


3Given functions Si, fo. B... .A linear combination of these functions is cy f1 + c2 f2 + c3 f3 +--+, where 
C1, C2, C3, ... are numbers (which need not be real). 

^1f one member of a set of functions (fA. P2, fà, ...) can be expressed as a linear combination of the remaining 
functions (i.e., if f| — c2 f2 -- c3 fà 4- ---), the set of functions is said to be linearly dependent. Otherwise, they 
are linearly independent. 
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However, since sin(0) — 0 and cos(0) — 1, this gives 
B=0 (1-27) 
Therefore, our first boundary condition forces B to be zero and leaves us with 
w(x) =Asin(Qrx/d) (1-28) 
Our second boundary condition, at x = L, gives 
w(L)= Asin(2z L/AX) =0 (1-29) 


One solution is provided by setting A equal to zero. This gives y — 0, which corresponds 
to no wave at all in the string. This is possible, but not very interesting. The other 
possibility is for 27 L/4 to be equal to 0, dr, +27, ... , Enz, ... since the sine function 
vanishes then. This gives the relation 


2znL/A—nm, n=0,+1,+2,... (1-30) 
or 
A=2L/n, n=0,+1,+2,... (1-31) 
Substituting this expression for A into Eq. (1-28) gives 
Vv(x)—Asin(nzx/L) n=0,+1,+2,... (1-32) 


Some of these solutions are sketched in Fig. 1-5. The solution for n = 0 is again the 
uninteresting V — 0 case. Furthermore, since sin(—x) equals —sin(x), it is clear that 
the set of functions produced by positive integers n is not physically different from the 
set produced by negative n, so we may arbitrarily restrict our attention to solutions with 
positive n. (The two sets are linearly dependent.) The constant A is still undetermined. 
It affects the amplitude of the wave. To determine A would require knowing how much 
energy is stored in the wave, that is, how hard the string was plucked. 

It is evident that there are an infinite number of acceptable solutions, each one 
corresponding to a different number of half-waves fitting between 0 and L. But an even 
larger infinity of waves has been excluded by the boundary conditions—namely, all 
waves having wavelengths not divisible into 2L an integral number of times. The result 


Figure 1-5 » Solutions for the time-independent wave equation in one dimension with boundary 
conditions y/(0) = v (L) — 0. 
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of applying boundary conditions has been to restrict the allowed wavelengths to certain 
discrete values. As we shall see, this behavior is closely related to the quantization of 
energies in quantum mechanics. 

The example worked out above is an extremely simple one. Nevertheless, it demon- 
strates how a differential equation and boundary conditions are used to define the 
allowed states for a system. One could have arrived at solutions for this case by simple 
physical argument, but this is usually not possible in more complicated cases. The dif- 
ferential equation provides a systematic approach for finding solutions when physical 
intuition is not enough. 


1-5 Light as an Electromagnetic Wave 


Suppose a charged particle is caused to oscillate harmonically on the z axis. If there 
is another charged particle some distance away and initially at rest in the xy plane, 
this second particle will commence oscillating harmonically too. Thus, energy is being 
transferred from the first particle to the second, which indicates that there is an oscil- 
lating electric field emanating from the first particle. We can plot the magnitude of 
this electric field at a given instant as it would be felt by a series of imaginary test 
charges stationed along a line emanating from the source and perpendicular to the axis 
of vibration (Fig. 1-6). 

If there are some magnetic compasses in the neighborhood of the oscillating charge, 
these will be found to swing back and forth in response to the disturbance. This means 
that an oscillating magnetic field is produced by the charge too. Varying the placement 
of the compasses will show that this field oscillates in a plane perpendicular to the 
axis of vibration of the charged particle. The combined electric and magnetic fields 
traveling along one ray in the x y plane appear in Fig. 1-7. 

The changes in electric and magnetic fields propagate outward with a characteristic 
velocity c, and are describable as a traveling wave, called an electromagnetic wave. 
Its frequency v is the same as the oscillation frequency of the vibrating charge. Its 
wavelength is A = c/v. Visible light, infrared radiation, radio waves, microwaves, 
ultraviolet radiation, X rays, and y rays are all forms of electromagnetic radiation, 
their only difference being their frequencies v. We shall continue the discussion in the 
context of light, understanding that it applies to all forms of electromagnetic radiation. 


| 
| 


Figure 1-6 » A harmonic electric-field wave emanating from a vibrating electric charge. The wave 
magnitude is proportional to the force felt by the test charges. The charges are only imaginary; if 
they actually existed, they would possess mass and under acceleration would absorb energy from the 
wave, causing it to attenuate. 
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Figure 1-7 » A harmonic electromagnetic field produced by an oscillating electric charge. The 
arrows without attached charges show the direction in which the north pole of a magnet would be 
attracted. The magnetic field is oriented perpendicular to the electric field. 


If a beam of light is produced so that the orientation of the electric field wave is 
always in the same plane, the light is said to be plane (or linearly) polarized. The plane- 
polarized light shown in Fig. 1-7 is said to be z polarized. If the plane of orientation 
of the electric field wave rotates clockwise or counterclockwise about the axis of travel 
(i.e., if the electric field wave "corkscrews" through space), the light is said to be right 
or left circularly polarized. If the light is a composite of waves having random field 
orientations so that there is no resultant orientation, the light is unpolarized. 

Experiments with light in the nineteenth century and earlier were consistent with 
the view that light is a wave phenomenon. One of the more obvious experimental 
verifications of this is provided by the interference pattern produced when light from a 
point source is allowed to pass through a pair of slits and then to fall on a screen. The 
resulting interference patterns are understandable only in terms of the constructive and 
destructive interference of waves. The differential equations of Maxwell, which pro- 
vided the connection between electromagnetic radiation and the basic laws of physics, 
also indicated that light is a wave. 

But there remained several problems that prevented physicists from closing the book 
on this subject. One was the inability of classical physical theory to explain the intensity 
and wavelength characteristics of light emitted by a glowing “blackbody.” This problem 
was studied by Planck, who was forced to conclude that the vibrating charged particles 
producing the light can exist only in certain discrete (separated) energy states. We 
shall not discuss this problem. Another problem had to do with the interpretation of a 
phenomenon discovered in the late 1800s, called the photoelectric effect. 


[J 1-6 The Photoelectric Effect 


This phenomenon occurs when the exposure of some material to light causes it to eject 
electrons. Many metals do this quite readily. A simple apparatus that could be used to 
study this behavior is drawn schematically in Fig. 1-8. Incident light strikes the metal 
dish in the evacuated chamber. If electrons are ejected, some of them will strike the 
collecting wire, giving rise to a deflection of the galvanometer. In this apparatus, one 
can vary the potential difference between the metal dish and the collecting wire, and 
also the intensity and frequency of the incident light. 

Suppose that the potential difference is set at zero and a current is detected when 
light of a certain intensity and frequency strikes the dish. This means that electrons 
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Figure 1-8 » A phototube. 


are being emitted from the dish with finite kinetic energy, enabling them to travel to 
the wire. If a retarding potential is now applied, electrons that are emitted with only a 
small kinetic energy will have insufficient energy to overcome the retarding potential 
and will not travel to the wire. Hence, the current being detected will decrease. The 
retarding potential can be increased gradually until finally even the most energetic 
photoelectrons cannot make it to the collecting wire. This enables one to calculate the 
maximum kinetic energy for photoelectrons produced by the incident light on the metal 
in question. 
The observations from experiments of this sort can be summarized as follows: 


1. Below a certain cutoff frequency of incident light, no photoelectrons are ejected, no 
matter how intense the light. 


2. Above the cutoff frequency, the number of photoelectrons is directly proportional 
to the intensity of the light. 


3. As the frequency of the incident light is increased, the maximum kinetic energy of 
the photoelectrons increases. 


4. In cases where the radiation intensity is extremely low (but frequency is above the 
cutoff value) photoelectrons are emitted from the metal without any time lag. 


Some of these results are summarized graphically in Fig. 1-9. Apparently, the kinetic 
energy of the photoelectron is given by 


kinetic energy = A (v — vo) (1-33) 


where A is a constant. The cutoff frequency vo depends on the metal being studied (and 
also its temperature), but the slope A is the same for all substances. 
We can also write the kinetic energy as 


kinetic energy — energy of light — energy needed to escape surface (1-34) 
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Figure 1-9 » Maximum kinetic energy of photoelectrons as a function of incident light frequency, 
where vo is the minimum frequency for which photoelectrons are ejected from the metal in the absence 
of any retarding or accelerating potential. 


The last quantity in Eq. (1-34) is often referred to as the work function W of the metal. 
Equating Eq. (1-33) with (1-34) gives 


energy of light — W = hv — hvo (1-35) 


The material-dependent term W is identified with the material-dependent term hvo, 
yielding 


energy of light = E = hv (1-36) 


where the value of / has been determined to be 6.626176 x 10774 J sec. (See Appendix 
10 for units and conversion factors.) 

Physicists found it difficult to reconcile these observations with the classical electro- 
magnetic field theory of light. For example, if light of a certain frequency and intensity 
causes emission of electrons having a certain maximum kinetic energy, one would 
expect increased light intensity (corresponding classically to a greater electromagnetic 
field amplitude and hence greater energy density) to produce photoelectrons of higher 
kinetic energy. However, it only produces more photoelectrons and does not affect their 
energies. Again, if light is a wave, the energy is distributed over the entire wavefront 
and this means that a low light intensity would impart energy at a very low rate to an 
area of surface occupied by one atom. One can calculate that it would take years for an 
individual atom to collect sufficient energy to eject an electron under such conditions. 
No such induction period is observed. 

An explanation for these results was suggested in 1905 by Einstein, who proposed 
that the incident light be viewed as being comprised of discrete units of energy. Each 
such unit, or photon, would have an associated energy of Av, where v is the frequency 
of the oscillating emitter. Increasing the intensity of the light would correspond to 
increasing the number of photons, whereas increasing the frequency of the light would 
increase the energy of the photons. If we envision each emitted photoelectron as 
resulting from a photon striking the surface of the metal, it is quite easy to see that 
Einstein's proposal accords with observation. But it creates a new problem: If we are 
to visualize light as a stream of photons, how can we explain the wave properties of 
light, such as the double-slit diffraction pattern? What is the physical meaning of the 
electromagnetic wave? 
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Essentially, the problem is that, in the classical view, the square of the electromag- 
netic wave at any point in space is a measure of the energy density at that point. Now 
the square of the electromagnetic wave is a continuous and smoothly varying function, 
and if energy is continuous and infinitely divisible, there is no problem with this the- 
ory. But if the energy cannot be divided into amounts smaller than a photon—if it has 
a particulate rather than a continuous nature—then the classical interpretation cannot 
apply, for it is not possible to produce a smoothly varying energy distribution from 
energy particles any more than it is possible to produce, at the microscopic level, a 
smooth density distribution in gas made from atoms of matter. Einstein suggested that 
the square of the electromagnetic wave at some point (that is, the sum of the squares 
of the electric and magnetic field magnitudes) be taken as the probability density for 
finding a photon in the volume element around that point. The greater the square of 
the wave in some region, the greater is the probability for finding the photon in that 
region. Thus, the classical notion of energy having a definite and smoothly varying 
distribution is replaced by the idea of a smoothly varying probability density for finding 
an atomistic packet of energy. 

Let us explore this probabilistic interpretation within the context of the two-slit 
interference experiment. We know that the pattern of light and darkness observed on 
the screen agrees with the classical picture of interference of waves. Suppose we carry 
out the experiment in the usual way, except we use a light source (of frequency v) so 
weak that only Av units of energy per second pass through the apparatus and strike 
the screen. According to the classical picture, this tiny amount of energy should strike 
the screen in a delocalized manner, producing an extremely faint image of the entire 
diffraction pattern. Over a period of many seconds, this pattern could be accumulated 
(on a photographic plate, say) and would become more intense. According to Einstein's 
view, our experiment corresponds to transmission of one photon per second and each 
photon strikes the screen at a localized point. Each photon strikes a new spot (not to 
imply the same spot cannot be struck more than once) and, over a long period of time, 
they build up the observed diffraction pattern. If we wish to state in advance where the 
next photon will appear, we are unable to do so. The best we can do is to say that the 
next photon is more likely to strike in one area than in another, the relative probabilities 
being quantitatively described by the square of the electromagnetic wave. 

The interpretation of electromagnetic waves as probability waves often leaves one 
with some feelings of unreality. If the wave only tells us relative probabilities for 
finding a photon at one point or another, one is entitled to ask whether the wave has 
“physical reality,” or if it is merely a mathematical device which allows us to analyze 
photon distribution, the photons being the “physical reality." We will defer discussion 
of this question until a later section on electron diffraction. 


EXAMPLE 1-1 A retarding potential of 2.38 volts just suffices to stop photoelectrons 


emitted from potassium by light of frequency 1.13 x 10? s7!. What is the work 
function, W , of potassium? 


SOLUTION »  Ejig;; =hv =W + K Eetectron, W — hv — K Eelectron = (4.136 x 107 eV s) 


(1.13 x 10/5 s7!) — 2.38 eV = 4.67 eV — 2.38 eV = 2.29 eV [Note convenience of using / in units 
of eV s for this problem. See Appendix 10 for data.] 4 


14 Chapter 1 Classical Waves and the Time-Independent Schródinger Wave Equation 


EXAMPLE 1-2 Spectroscopists often express A E for a transition between states in 
wavenumbers , e.g., m 1 orem" !, rather than in energy units like J or eV. (Usually 


cm! is favored, so we will proceed with that choice.) 


a) What is the physical meaning of the term wavenumber? 
b) What is the connection between wavenumber and energy? 
c) What wavenumber applies to an energy of 1.000 J? of 1.000 eV? 


SOLUTION » a) Wavenumber is the number of waves that fit into a unit of distance (usually of 
one centimeter). It is sometimes symbolized v. v = 1/4, where A is the wavelength in centimeters. 
b) Wavenumber characterizes the light that has photons of the designated energy. E = hv = hc /à = 
hcv. (where c is given in cm/s). 

c) E = 1.000 J = Aci; v — 1.000 J/hc = 1.000 J /[(6.626 x 10734 J s)(2.998 x 1019 cm/s)] = 
5.034 x 102 cm! . Clearly, this is light of an extremely short wavelength since more than 1022 
wavelengths fit into 1 cm. For 1.000 eV, the above equation is repeated using A in eV s. This gives 
b= 8065 cm! . 4 


E 1-7 The Wave Nature of Matter 


Evidently light has wave and particle aspects, and we can describe it in terms of photons, 
which are associated with waves of frequency v = E/ h. Now photons are rather peculiar 
particles in that they have zero rest mass. In fact, they can exist only when traveling 
at the speed of light. The more normal particles in our experience have nonzero rest 
masses and can exist at any velocity up to the speed-of-light limit. Are there also waves 
associated with such normal particles? 

Imagine a particle having a finite rest mass that somehow can be made lighter and 
lighter, approaching zero in a continuous way. It seems reasonable that the existence 
of a wave associated with the motion of the particle should become more and more 
apparent, rather than the wave coming into existence abruptly when m — 0. De Broglie 
proposed that all material particles are associated with waves, which he called “matter 
waves,” but that the existence of these waves is likely to be observable only in the 
behaviors of extremely light particles. 

De Broglie's relation can be reached as follows. Einstein's relation for photons is 


E=hv (1-37) 
But a photon carrying energy E has a relativistic mass given by 
E=mc? (1-38) 
Equating these two equations gives 
E=mc? =hv=hc/az (1-39) 
or 


mc — h[A (1-40) 
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A normal particle, with nonzero rest mass, travels at a velocity v. If we regard Eq. (1-40) 
as merely the high-velocity limit of a more general expression, we arrive at an equation 
relating particle momentum p and associated wavelength i: 


mvy-—p-h[X (1-41) 
or 
A=h/p (1-42) 


Here, m refers to the rest mass of the particle plus the relativistic correction, but the 
latter is usually negligible in comparison to the former. 

This relation, proposed by de Broglie in 1922, was demonstrated to be correct shortly 
thereafter when Davisson and Germer showed that a beam of electrons impinging on a 
nickel target produced the scattering patterns one expects from interfering waves. These 
“electron waves” were observed to have wavelengths related to electron momentum in 
just the manner proposed by de Broglie. 

Equation (1-42) relates the de Broglie wavelength à of a matter wave to the momen- 
tum p of the particle. A higher momentum corresponds to a shorter wavelength. Since 


kinetic energy T = mv? = (1/2m) (m?v?) = p? [2m (1-43) 
it follows that 
p=v2mT (1-44) 


Furthermore, Since E = T + V, where £ is the total energy and V is the potential 
energy, we can rewrite the de Broglie wavelength as 


A= MTS M (1-45) 
J/A2m(E — V) 

Equation (1-45) is useful for understanding the way in which à will change for a 
particle moving with constant total energy in a varying potential. For example, if the 
particle enters a region where its potential energy increases (e.g., an electron approaches 
a negatively charged plate), E — V decreases and X increases (i.e., the particle slows 
down, so its momentum decreases and its associated wavelength increases). We shall 
see examples of this behavior in future chapters. 

Observe that if E > V,A as given by Eq. (1-45) is real. However, if E «V, 
becomes imaginary. Classically, we never encounter such a situation, but we will find 
it is necessary to consider this possibility in quantum mechanics. 


EXAMPLE 1-3 A He?* ion is accelerated from rest through a voltage drop of 1.000 
kilovolts. What is its final deBroglie wavelength? Would the wavelike properties 
be very apparent? 


SOLUTION » Since a charge of two electronic units has passed through a voltage drop 
of 1.000 x 10? volts, the final kinetic energy of the ion is 2.000 x 10? eV. To calculate A, we first 
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convert from eV to joules: KE = p?/2m = (2.000 x 10? eV)(1.60219 x 1071? J/eV) = 3.204 
x 10716 J. mye = (4.003 g/mol) (10? kg/g)(1 mol/6.022 x 102?atoms) = 6.65 x 10727 kg; 
p=J/2m He KE = [2(6.65 x 10727 kg) (3.204 x 10716 ]U/22 2.1 x 107?! kg m/s. A=h/ p= 
(6.626 x 1074 Js)/ 2.1 x 107?! kg m/s) 23.2 x 10-1? m —0.32 pm. This wavelength is on the 
order of 1% of the radius of a hydrogen atom-too short to produce observable interference results 
when interacting with atom-size scatterers. For most purposes, we can treat this ion as simply a 
high-speed particle. < 


E 1-8 A Diffraction Experiment with Electrons 


In order to gain a better understanding of the meaning of matter waves, we now consider 
a set of simple experiments. Suppose that we have a source of a beam of monoener- 
getic electrons and a pair of slits, as indicated schematically in Fig. 1-10. Any electron 
arriving at the phosphorescent screen produces a flash of light, just as in a television 
set. For the moment we ignore the light source near the slits (assume that it is turned 
off) and inquire as to the nature of the image on the phosphorescent screen when the 
electron beam is directed at the slits. The observation, consistent with the observations 
of Davisson and Germer already mentioned, is that there are alternating bands of light 
and dark, indicating that the electron beam is being diffracted by the slits. Further- 
more, the distance separating the bands is consistent with the de Broglie wavelength 
corresponding to the energy of the electrons. The variation in light intensity observed 
on the screen is depicted in Fig. 1-11a. 

Evidently, the electrons in this experiment are displaying wave behavior. Does this 
mean that the electrons are spread out like waves when they are detected at the screen? 
We test this by reducing our beam intensity to let only one electron per second through 
the apparatus and observe that each electron gives a localized pinpoint of light, the 
entire diffraction pattern building up gradually by the accumulation of many points. 
Thus, the square of de Broglie’s matter wave has the same kind of statistical significance 
that Einstein proposed for electromagnetic waves and photons, and the electrons really 
are localized particles, at least when they are detected at the screen. 

However, if they are really particles, it is hard to see how they can be diffracted. 
Consider what happens when slit b is closed. Then all the electrons striking the screen 
must have come through slit a. We observe the result to be a single area of light on 
the screen (Fig. 1-11b). Closing slit a and opening b gives a similar (but displaced) 


Observer 
< 


Phosphorescent screen 
Microscope 
a 


Rs 1 Double slit «B “O% Light source P 
Collimating slit b Observer 


Source of monoenergetic 
beam of electrons 


Figure 1-10 » The electron source produces a beam of electrons, some of which pass through slits 
a and/or b to be detected as flashes of light on the phosphorescent screen. 
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(a) (b) (c) (d) (e) 


Intensity 
peser ut 


Figure 1-11 » Light intensity at phosphorescent screen under various conditions: (a) a and b open, 
light off; (b) a open, b closed, light off; (c) a closed, b open, light off; (d) a and b open, light on, À 
short; (e) a and b open, light on, A longer. 


light area, as shown in Fig. 1-11c. These patterns are just what we would expect for 
particles. Now, with both slits open, we expect half the particles to pass through slit a 
and half through slit b, the resulting pattern being the sum of the results just described. 
Instead we obtain the diffraction pattern (Fig. 1-11a). How can this happen? It seems 
that, somehow, an electron passing through the apparatus can sense whether one or 
both slits are open, even though as a particle it can explore only one slit or the other. 
One might suppose that we are seeing the result of simultaneous traversal of the two 
slits by two electrons, the path of each electron being affected by the presence of an 
electron in the other slit. This would explain how an electron passing through slit a 
would “know” whether slit b was open or closed. But the fact that the pattern builds 
up even when electrons pass through at the rate of one per second indicates that this 
argument will not do. Could an electron be coming through both slits at once? 

To test this question, we need to have detailed information about the positions of the 
electrons as they pass through the slits. We can get such data by turning on the light 
source and aiming a microscope at the slits. Then photons will bounce off each electron 
as it passes the slits and will be observed through the microscope. The observer thus 
can tell through which slit each electron has passed, and also record its final position 
on the phosphorescent screen. In this experiment, it is necessary to use light having 
a wavelength short in comparison to the interslit distance; otherwise the microscope 
cannot resolve a flash well enough to tell which slit it is nearest. When this experiment 
is performed, we indeed detect each electron as coming through one slit or the other, 
and not both, but we also find that the diffraction pattern on the screen has been lost 
and that we have the broad, featureless distribution shown in Fig. 1-11d, which is 
basically the sum of the single-slit experiments. What has happened is that the photons 
from our light source, in bouncing off the electrons as they emerge from the slits, have 
affected the momenta of the electrons and changed their paths from what they were 
in the absence of light. We can try to counteract this by using photons with lower 
momentum; but this means using photons of lower E, hence longer A. As a result, 
the images of the electrons in the microscope get broader, and it becomes more and 
more ambiguous as to which slit a given electron has passed through or that it really 
passed through only one slit. As we become more and more uncertain about the path 
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of each electron as it moves past the slits, the accumulating diffraction pattern becomes 
more and more pronounced (Fig. 1-11e). (Since this is a “thought experiment,” we can 
ignore the inconvenient fact that our "light" source must produce X rays or y rays in 
order to have a wavelength short in comparison to the appropriate interslit distance.) 
This conceptual experiment illustrates a basic feature of microscopic systems—we 
cannot measure properties of the system without affecting the future development of the 
system in a nontrivial way. The system with the light turned off is significantly different 
from the system with the light turned on (with short A), and so the electrons arrive at the 
screen with different distributions. No matter how cleverly one devises the experiment, 
there is some minimum necessary disturbance involved in any measurement. In this 
example with the light off, the problem is that we know the momentum of each electron 
quite accurately (since the beam is monoenergetic and collimated), but we do not know 
anything about the way the electrons traverse the slits. With the light on, we obtain 
information about electron position just beyond the slits but we change the momentum 
of each electron in an unknown way. The measurement of particle position leads to 
loss of knowledge about particle momentum. This is an example of the uncertainty 
principle of Heisenberg, who stated that the product of the simultaneous uncertainties in 


"conjugate variables,” a and b, can never be smaller than the value of Planck’s constant 
h divided by 4r: 


Aa - Ab > h/An (1-46) 


Here, Aa is a measure of the uncertainty in the variable a, etc. (The easiest way to 
recognize conjugate variables is to note that their dimensions must multiply to joule 
seconds. Linear momentum and linear position satisfies this requirement. Two other 
important pairs of conjugate variables are energy-time and angular momentum-angular 
position.) In this example with the light off, our uncertainty in momentum is small 
and our uncertainty in position is unacceptably large, since we cannot say which slit 
each electron traverses. With the light on, we reduce our uncertainty in position to 
an acceptable size, but subsequent to the position of each electron being observed, we 
have much greater uncertainty in momentum. 

Thus, we see that the appearance of an electron (or a photon) as a particle or a wave 
depends on our experiment. Because any observation on so small a particle involves a 
significant perturbation of its state, it is proper to think of the electron plus apparatus 
as a single system. The question, “Is the electron a particle or a wave?" becomes 
meaningful only when the apparatus is defined on which we plan a measurement. 
In some experiments, the apparatus and electrons interact in a way suggestive of the 
electron being a wave, in others, a particle. The question, “What is the electron when 
were not looking?;" cannot be answered experimentally, since an experiment is a “look” 
at the electron. In recent years experiments of this sort have been carried out using 
single atoms.? 


EXAMPLE 1-4 The lifetime of an excited state of a molecule is 2 x 10-? s. What 


is the uncertainty in its energy in J? In cm~!? How would this manifest itself 
experimentally? 


5See F. Flam [1]. 
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SOLUTION » The Heisenberg uncertainty principle gives, for minimum uncertainty AE - At = 
h/An. AE = (6.626 x 10734 J s)/[(4zx) Q. x 107? s)] 22.6 x 10726 J (2.6 x 10726]) (5.03 x 1022 
em—!J~!) 20.001 cm7! (See Appendix 10 for data.) Larger uncertainty in E shows up as greater 
line-width in emission spectra. 4 


NH 1-9 Schródinger's Time-Independent Wave Equation 


Earlier we saw that we needed a wave equation in order to solve for the standing waves 
pertaining to a particular classical system and its set of boundary conditions. The same 
need exists for a wave equation to solve for matter waves. Schródinger obtained such 
an equation by taking the classical time-independent wave equation and substituting 
de Broglie’s relation for A. Thus, if 


Vw 2 —Qzx/A)) y (1-47) 
and 
A= M S (1-48) 
~ J2m(E - Y) i 
then 


[7 02/8n?7m V? +V (x, y, 2] V (x, y, = Ey, y, 2) (1-49) 


Equation (1-49) is Schródinger's time-independent wave equation for a single particle 
of mass m moving in the three-dimensional potential field V . 

In classical mechanics we have separate equations for wave motion and particle 
motion, whereas in quantum mechanics, in which the distinction between particles and 
waves is not clear-cut, we have a single equation—the Schródinger equation. We have 
seen that the link between the Schródinger equation and the classical wave equation is 
the de Broglie relation. Let us now compare Schródinger's equation with the classical 
equation for particle motion. 

Classically, for a particle moving in three dimensions, the total energy is the sum of 
kinetic and potential energies: 


(1/2m)(py + p+ p) V =E (1-50) 


where p, is the momentum in the x coordinate, etc. We have just seen that the analogous 
Schródinger equation is [writing out Eq. (1-49)] 


—h2 32 32 32 
| ( mU )eresa| ve sac EVEA (1-51) 


822m NOx? day? ƏZ 


It is easily seen that Eq. (1-50) is linked to the quantity in brackets of Eq. (1-51) by a 
relation associating classical momentum with a partial differential operator: 


Px <> (h/2ni)(8/8x) (1-52) 


and similarly for p, and pz. The relations (1-52) will be seen later to be an important 
postulate in a formal development of quantum mechanics. 
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The left-hand side of Eq. (1-50) is called the hamiltonian for the system. For this 
reason the operator in square brackets on the LHS of Eq. (1-51) is called the hamiltonian 
operator H. For a given system, we shall see that the construction of H is not difficult. 
The difficulty comes in solving Schródinger's equation, often written as 


Hy — Ey (1-53) 


The classical and quantum-mechanical wave equations that we have discussed are 
members of a special class of equations called eigenvalue equations. Such equations 
have the format 


Op f=cf (1-54) 


where Op is an operator, f is a function, and c is a constant. Thus, eigenvalue equations 
have the property that operating on a function regenerates the same function times a 
constant. The function f that satisfies Eq. (1-54) is called an eigenfunction of the 
operator. The constant c is called the eigenvalue associated with the eigenfunction 
f. Often, an operator will have a large number of eigenfunctions and eigenvalues of 
interest associated with it, and so an index is necessary to keep them sorted, viz. 


Op fi=ci fi (1-55) 


We have already seen an example of this sort of equation, Eq. (1-19) being an eigen- 
function for Eq. (1-18), with eigenvalue —w*m/T. 

The solutions y for Schródinger's equation (1-53), are referred to as eigenfunctions, 
wavefunctions, or state functions. 


EXAMPLE 1-5 a) Show that sin(3.63x) is not an eigenfunction of the operator 
d/dx. 
b) Show that exp(—3.63ix) is an eigenfunction of the operator d/dx. What is its 


eigenvalue? 
c) Show that + sin(3.63x) is an eigenfunction of the operator 


((—h? /81?m)d? /dx*). What is its eigenvalue? 


SOLUTION » 2) ft sin(3.63x) = 3.63 cos(3.63x) Æ constant times sin(3.63x). 
b) L exp(—3.63ix) = —3.63i exp(—3.63ix) = constant times exp(—3.63ix). Eigenvalue = 
—3.63i. 
c) (—42/812m)d? /dx?) } sin(3.63x) = (—h? 812m) (1/1) G.63) £ cos(3.63x) 
= [(3.63)2 1? /81?m] - (1/7) sin(3.63x) 
= constant times (1/77) sin(3.63x). 
Eigenvalue = (3.6325? /81?m. < 


6An operator is a symbol telling us to carry out a certain mathematical operation. Thus, d/dx is a differential 
operator telling us to differentiate anything following it with respect to x. The function 1/x may be viewed as a 
multiplicative operator. Any function on which it operates gets multiplied by 1/x. 
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1-10 Conditions on y 


We have already indicated that the square of the electromagnetic wave is interpreted as 
the probability density function for finding photons at various places in space. We now 
attribute an analogous meaning to Y° for matter waves. Thus, in a one-dimensional 
problem (for example, a particle constrained to move on a line), the probability that the 
particle will be found in the interval dx around the point x; is taken to be Y? (x1) dx. 
If y is a complex function, then the absolute square, ly? = y*y is used instead of 
yw.’ This makes it mathematically impossible for the average mass distribution to be 
negative in any region. 

If an eigenfunction y has been found for Eq. (1-53), it is easy to see that cw will 
also be an eigenfunction, for any constant c. This is due to the fact that a multiplicative 
constant commutes? with the operator H, that is, 


H(cw)=cHw=cEw= E(cy) (1-56) 


The equality of the first and last terms is a statement of the fact that cy is an eigen- 
function of H. The question of which constant to use for the wavefunction is resolved 
by appeal to the probability interpretation of |y |?. For a particle moving on the x axis, 
the probability that the particle is between x = —oo and x = -Foo is unity, that is, a 
certainty. This probability is also equal to the sum of the probabilities for finding the 
particle in each and every infinitesimal interval along x, so this sum (an integral) must 
equal unity: 
+00 
c*c V*(x)y (x) dx 21 (1-57) 
—oo0 
If the selection of the constant multiplier c is made so that Eq. (1-57) is satisfied, 
the wavefunction y’ = cv is said to be normalized. For a three-dimensional function, 
CV (x, y, Z), the normalization requirement is 


+00 +00 +00 
ote [ J w(x, y, W(x, y, z dxdydz= e f Iy? dv=1 
—oo 4—oo 4—oo all space 
(1-58) 


As a result of our physical interpretation of |V/|? plus the fact that y must be an 
eigenfunction of the hamiltonian operator H, we can reach some general conclusions 
about what sort of mathematical properties w can or cannot have. 

First, we require that y be a single-valued function because we want |y]? to give an 
unambiguous probability for finding a particle in a given region (see Fig. 1-12). Also, 
we reject functions that are infinite in any region of space because such an infinity 
will always be infinitely greater than any finite region, and |y |? will be useless as a 
measure of comparative probabilities.” In order for Hy to be defined everywhere, it 
is necessary that the second derivative of y be defined everywhere. This requires that 
the first derivative of y be piecewise continuous and that y itself be continuous as in 
Fig.1d. (We shall see an example of this shortly.) 


"pf f=u+tiv, then f*, the complex conjugate of f, is given by u — iv, where u and v are real functions. 

8a and b are said to commute if ab = ba. 

?'There are cases, particularly in relativistic treatments, where y is infinite at single points of zero measure, so 
that ly? dx remains finite. Normally we do not encounter such situations in quantum chemistry. 
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y 
(a) Not acceptable 
(not single valued) 
m x 
9 
y 
(b) Not acceptable 


ara (not continuous) 


Not acceptable 
(has infinite value 
at x = o9) 


(d) Acceptable 


Xo 


Figure 1-12 » (a) v is triple valued at xo. (b) y is discontinuous at xo. (c) V grows without limit 
as x approaches 4-oo (i.e., y “blows up,” or *explodes"). (d) v is continuous and has a “cusp” at xo. 
Hence, first derivative of y is discontinuous at xo and is only piecewise continuous. This does not 
prevent w from being acceptable. 


Functions that are single-valued, continuous, nowhere infinite, and have piecewise 
continuous first derivatives will be referred to as acceptable functions. The meanings 
of these terms are illustrated by some sample functions in Fig. 1-12. 

In most cases, there is one more general restriction we place on w, namely, that 
it be a normalizable function. This means that the integral of |y|* over all space 
must not be equal to zero or infinity. A function satisfying this condition is said to 
be square-integrable. 


N 1-11 Some Insight into the Schródinger Equation 


There is a fairly simple way to view the physical meaning of the Schródinger equation 
(1-49). The equation essentially states that E in Hy = Ey depends on two things, V 
and the second derivatives of yy. Since V is the potential energy, the second derivatives 
of y must be related to the kinetic energy. Now the second derivative of w with respect 
to a given direction is a measure of the rate of change of slope (i.e., the curvature, or 
"wigeliness") of y in that direction. Hence, we see that a more wiggly wavefunction 
leads, through the Schródinger equation, to a higher kinetic energy. This is in accord 
with the spirit of de Broglie's relation, since a shorter wavelength function is a more 
wiggly function. But the Schródinger equation is more generally applicable because we 
can take second derivatives of any acceptable function, whereas wavelength is defined 
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Higher E 


Figure 1-13 » (a) Since V —0, E — T. For higher T, y is more wiggly, which means that A is 
shorter. (Since y is periodic for a free particle, A is defined.) (b) As V increases from left to right, v 
becomes less wiggly. (c)-(d) v is most wiggly where V is lowest and T is greatest. 


only for periodic functions. Since £ is a constant, the solutions of the Schrödinger 
equation must be more wiggly in regions where V is low and less wiggly where V is 
high. Examples for some one-dimensional cases are shown in Fig. 1-13. 

In the next chapter we use some fairly simple examples to illustrate the ideas that 
we have already introduced and to bring out some additional points. 


|J 1-12 Summary 


In closing this chapter, we collect and summarize the major points to be used in 
future discussions. 


1. Associated with any particle is a wavefunction having wavelength related to particle 


momentum by A = h/p-—h/A/2m(E — V). 


2. 'The wavefunction has the following physical meaning; its absolute square is pro- 
portional to the probability density for finding the particle. If the wavefunction is 
normalized, its square is equal to the probability density. 


3. The wavefunctions v for time-independent states are eigenfunctions of Schródinger's 
equation, which can be constructed from the classical wave equation by requir- 
ing A=h/./2m(E — V), or from the classical particle equation by replacing p; 
with (A/27z1)8/8k, k — x, y, z. 


24 Chapter 1 Classical Waves and the Time-Independent Schródinger Wave Equation 


4. For y to be acceptable, it must be single-valued, continuous, nowhere infinite, with 
a piecewise continuous first derivative. For most situations, we also require w to 
be square-integrable. 


5. 'The wavefunction for a particle in a varying potential oscillates most rapidly where 
V is low, giving a high T in this region. The low V plus high T equals £. In another 
region, where V is high, the wavefunction oscillates more slowly, giving a low T, 
which, with the high V , equals the same Z as in the first region. 


1-12.A Problems’? 


1-1. Express A cos(kx) + B sin(kx) + C exp(ikx) + Dexp(—ikx) purely in terms of 
cos(kx) and sin(kx). 


1-2. Repeat the standing-wave-in-a-string problem worked out in Section 1-4, but 
clamp the string at x = 4-L/2 and —L/2 instead of at 0 and L. 


1-3. Find the condition that must be satisfied by o and f in order that y (x) = 
Asin(ax) + B cos(x) satisfy Eq. (1-20). 


1-4. The apparatus sketched in Fig. 1-8 is used with a dish plated with zinc and also 
with a dish plated with cesium. The wavelengths of the incident light and the 
corresponding retarding potentials needed to just prevent the photoelectrons from 
reaching the collecting wire are given in Table P1-4. Plot incident light frequency 
versus retarding potential for these two metals. Evaluate their work functions 
(in eV) and the proportionality constant h (in eV s). 


TABLE P1-4 » 
Retarding potential (V) 

ACÀ) Cs Zn 
6000 0.167 — 
3000 2.235 0.435 
2000 4.302 2.502 
1500 6.369 1.567 
1200 8.436 6.636 


1-5. Calculate the de Broglie wavelength in nanometers for each of the following: 


a) An electron that has been accelerated from rest through a potential change of 
500 V. 
b) A bullet weighing 5 gm and traveling at 400 m s-!. 


1-6. Arguing from Eq. (1-7), what is the time needed for a standing wave to go through 
one complete cycle? 


10Hints for a few problems may be found in Appendix 12 and answers for almost all of them appear in 
Appendix 13. 
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1-7. 


1-8. 


1-9. 


1-12. 


The equation for a standing wave in a string has the form 
V (x, t) = v (x)cos(ot) 


a) Calculate the time-averaged potential energy (PE) for this motion. [Hint: Use 
PE--/FdWV;F-ma;a-?^wvjJàr?.] 

b) Calculate the time-averaged kinetic energy (KE) for this motion. [Hint: Use 
KE = 1/2m? and v=dW/dt.] 

c) Show that this harmonically vibrating string stores its energy on the average 
half as kinetic and half as potential energy, and that E avay? (x). 


Indicate which of the following functions are “acceptable.” If one is not, give 
a reason. 


a) y =x 

b) y 2x? 

c) y —sinx 

d) y =exp(—x) 


e) y —expC-x?^) 


An acceptable function is never infinite. Does this mean that an acceptable 
function must be square integrable? If you think these are not the same, try to 
find an example of a function (other than zero) that is never infinite but is not 
square integrable. 


. Explain why the fact that sin(x) = —sin(—x) means that we can restrict 


Eq. (1-32) to nonnegative n without loss of physical content. 


. Which of the following are eigenfunctions for d/dx? 


a) x? 

b) exp(—3.4x?) 

c) 37 

d) exp(x) 

e) sin(ax) 

f) cos(4x) + i sin(4x) 


Calculate the minimum de Broglie wavelength for a photoelectron that is pro- 
duced when light of wavelength 140.0 nm strikes zinc metal. (Workfunction of 
zinc = 3.63 eV.) 


Multiple Choice Questions 


(Intended to be answered without use of pencil and paper.) 


1. A particle satisfying the time-independent Schrödinger equation must have 


a) an eigenfunction that is normalized. 
b) a potential energy that is independent of location. 
c) a de Broglie wavelength that is independent of location. 
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d) a total energy that is independent of location. 
e) None of the above is a true statement. 


2. When one operates with d? / dx? on the function 6sin(4x), one finds that 


a) the function is an eigenfunction with eigenvalue —96. 
b) the function is an eigenfunction with eigenvalue 16. 
c) the function is an eigenfunction with eigenvalue —16. 
d) the function is not an eigenfunction. 

e) None of the above is a true statement. 


3. Which one of the following concepts did Einstein propose in order to explain the 
photoelectric effect? 


a) A particle of rest mass m and velocity v has an associated wavelength A given by 
A=h/mv. 

b) Doubling the intensity of light doubles the energy of each photon. 

c) Increasing the wavelength of light increases the energy of each photon. 

d) The photoelectron is a particle. 

e) None of the above is a concept proposed by Einstein to explain the photoelectric 
effect. 


4. Light of frequency v strikes a metal and causes photoelectrons to be emitted having 
maximum kinetic energy of 0.90 Av. From this we can say that 


a) light of frequency v/2 will not produce any photoelectrons. 

b) light of frequency 2v will produce photoelectrons having maximum kinetic 
energy of 1.80 Av. 

c) doubling the intensity of light of frequency v will produce photoelectrons having 
maximum kinetic energy of 1.80 Av. 

d) the work function of the metal is 0.90 Av. 

e) None of the above statements is correct. 


5. The reason for normalizing a wavefunction y is 


a) to guarantee that y is square-integrable. 

b) to make y/*y equal to the probability distribution function for the particle. 
c) to make y an eigenfunction for the Hamiltonian operator. 

d) to make w satisfy the boundary conditions for the problem. 

e) to make y display the proper symmetry characteristics. 
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Chapter 2 


Quantum Mechanics of Some 
Simple Systems 


E 2-1 The Particle in a One-Dimensional “Box” 


Imagine that a particle of mass m is free to move along the x axis between x = 0 and 
x = L, with no change in potential (set V —O0 for 0 < x < L). At x =0 and L and at all 
points beyond these limits the particle encounters an infinitely repulsive barrier (V = oo 
for x <0, x > L). The situation is illustrated in Fig. 2-1. Because of the shape of this 
potential, this problem is often referred to as a particle in a square well or a particle in 
a box problem. It is well to bear in mind, however, that the situation is really like that 
of a particle confined to movement along a finite length of wire. 

When the potential is discontinuous, as it is here, it is convenient to write a wave 
equation for each region. For the two regions beyond the ends of the box 


2 472 
aN + cow = Ey. x<0,x>L ei) 
Within the box, y must satisfy the equation 
miu M Ey, O<x<L 
822m dx? , (2-2) 


It should be realized that E must take on the same values for both of these equations; 
the eigenvalue E pertains to the entire range of the particle and is not influenced by 
divisions we make for mathematical convenience. 

Let us examine Eq. (2-1) first. Suppose that, at some point within the infinite barrier, 
say x = L + dx, wis finite. Then the second term on the left-hand side of Eq. (2-1) will 
be infinite. If the first term on the left-hand side is finite or zero, it follows immediately 
that E is infinite at the point L + dx (and hence everywhere in the system). Is it 
possible that a solution exists such that F is finite? One possibility is that y — 0 at all 
points where V = oo. The other possibility is that the first term on the left-hand side 
of Eq. (2-1) can be made to cancel the infinite second term. This might happen if the 
second derivative of the wavefunction is infinite at all points where V = co and y £0. 
For the second derivative to be infinite, the first derivative must be discontinuous, and 
so y itself must be nonsmooth (i.e., it must have a sharp corner; see Fig. 2-2). Thus, 
we see that it may be possible to obtain a finite value for both E and y at x = L + dx, 
provided that y is nonsmooth there. But what about the next point, x = L + 2dx, and 
all the other points outside the box? If we try to use the same device, we end up with 
the requirement that y be nonsmooth at every point where V = oo. A function that is 
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Figure 2-1 » The potential felt by a particle as a function of its x coordinate. 


continuous but which has a point-wise discontinuous first derivative is a contradiction 
in terms (i.e., a continuous f cannot be 100% corners. To have recognizable corners, 
we must have some (continuous) edges. We say that the first derivative of y must be 
piecewise continuous.) Hence, if V = co at a single point, we might find a solution y 
which is finite at that point, with finite energy. If V — oo over a finite range of connected 
points, however, either E for the system is infinite, and y is finite over this region or 
E is not infinite (but is indeterminate) and y is zero over this region. 

We are not concerned with particles of infinite energy, and so we will say that the 
solution to Eq. (2-1) is yy =0.! 

Turning now to Eq. (2-2) we ask what solutions y exist in the box having associated 
eigenvalues E that are finite and positive. Any function that, when twice differentiated, 
yields a negative constant times the selfsame function is a possible candidate for v. 
Such functions are sin(kx), cos(kx), and exp(+ikx). But these functions are not all 
independent since, as we noted in Chapter 1, 


exp(tikx) =cos(kx) + i sin(kx) (2-3) 


We thus are free to express y in terms of exp(+ikx) or else in terms of sin(kx) and 
cos(kx). We choose the latter because of their greater familiarity, although the final 
answer must be independent of this choice. 

The most general form for the solution is 


w (x) = Asin(kx) + Bcos(kx) (2-4) 


where A, B, and k remain to be determined. As we have already shown, y is zero at 
x <0,x > L and so we have as boundary conditions 


v0) =0 (2-5) 
V) =0 (2-6) 


Mathematically, this is precisely the same problem we have already solved in Chap- 
ter 1 for the standing waves in a clamped string. The solutions are 


w(x) = Asin(nzx/L), n=1,2,..., O<x<L 
w(x) =0, x<0,x> L (2-7) 


'Thus, the particle never gets into these regions. It is meaningless to talk of the energy of the particle in such 
regions, and our earlier statement that E must be identical in Eqs. (2-1) and (2-2) must be modified; E is constant 
in all regions where yw is finite. 
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Figure 2-2 » As the function f(x) approaches being nonsmooth, ô approaches zero (the width of 
one point) and n approaches infinity. 


One difference between Eq. (2-7) and the string solutions is that we have rejected the 
n — 0 solution in Eq. (2-7). For the string, this solution was for no vibration at all— 
a physically realizable circumstance. For the particle-in-a-box problem, this solution 
is rejected because it is not square-integrable. (It gives y =0, which means no particle 
on the x axis, contradicting our starting premise. One could also reject this solution 
for the classical case since it means no energy in the string, which might contradict a 
starting premise depending on how the problem is worded.) 
Let us check to be sure these functions satisfy Schródinger's equation: 


—h? d?[Asin(nzx/L)] 


Hwy(x)— 


8x?m dx? 
—h? n?n? . ,nnzx 
- =| A D sin ( L )| 
= I [asin (2) |= £c 2-8) 


This shows that the functions (2-7) are indeed eigenfunctions of H. We note in passing 
that these functions are acceptable in the sense of Chapter 1. 
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The only remaining parameter is the constant 4. We set this to make the probability 
of finding the particle in the well equal to unity: 


L L 
/ Ww? (x)dx = 4? Í sin?(nzx/L)dx =1 (2-9) 
0 0 
This leads to (Problem 2-2) 
A-—AX2/L (2-10) 


which completes the solving of Schródinger's time-independent equation for the prob- 
lem. Our results are the normalized eigenfunctions 


V.(x)2VQ/L)sin(nzx/L), n=1,2,3,... Q-11) 
and the corresponding eigenvalues, from Eq. (2-8), 
E, =n°h?/8mL?, n=1,2,3,... (2-12) 


Each different value of n corresponds to a different stationary state of this system. 


E 2-2 Detailed Examination of Particle-in-a-Box Solutions 


Having solved the Schrödinger equation for the particle in the infinitely deep square- 
well potential, we now examine the results in more detail. 


2-2.A Energies 


The most obvious feature of the energies is that, as we move through the allowed states 
(n=1,2,3,...), E skips from one discrete, well-separated value to another (1, 4, 9 
in units of h? /8m L?). Thus, the particle can have only certain discrete energies—the 
energy is quantized. This situation is normally indicated by sketching the allowed 
energy levels as horizontal lines superimposed on the potential energy sketch, as in 
Fig. 2-3a. The fact that each energy level is a horizontal line emphasizes the fact that E is 
a constant and is the same regardless of the x coordinate of the particle. For this reason, 
E is called a constant of motion. The dependence of E on n? is displayed in the increased 
spacing between levels with increasing n in Fig. 2-3a. The number n is called a quantum 
number. 

We note also that E is proportional to L~*. This means that the more tightly a particle 
is confined, the greater is the spacing between the allowed energy levels. Alternatively, 
as the box is made wider, the separation between energies decreases and, in the limit 
of an infinitely wide box, disappears entirely. Thus, we associate quantized energies 
with spatial confinement. 

For some systems, the degree of confinement of a particle depends on its total energy. 
For example, a pendulum swings over a longer trajectory if it has higher energy. The 
potential energy for a pendulum is given by V = kx? and is given in Fig. 2-3b. If 
one solves the Schródinger equation for this system (see Chapter 3), one finds that the 
energies are proportional to n rather than n*. We can rationalize this by thinking of 
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(a) (b) (c) 
Figure 2-3 » Allowed energies for a particle in various one-dimensional potentials. (a) “box” with 


infinite walls. (b) quadratic potential, V = dix? (c) V = —1/ |x|. Tendency for higher levels in (b) 
and (c) not to diverge as in (a) is due to larger “effective box size" for higher energies in (b) and (c). 


the particle as occupying successively bigger boxes as we go to higher energies. This 
counteracts the n? increase in energy levels found for constant box width. For the 
potential V = —1/ |x| (which is the one-dimensional analog of a hydrogen atom) E 
varies as 1/n? (Fig. 2-3c), and this is also consistent with the effective increase in L 
with increasing £. 

The energy is proportional to 1/m. This means that the separation between allowed 
energy levels decreases as m increases. Ultimately, for a macroscopic object, m is so 
large that the levels are too closely spaced to be distinguished from the continuum of 
levels expected in classical mechanics. This is an example of the correspondence prin- 
ciple, which, in its most general form, states that the predictions of quantum mechanics 
must pass smoothly into those of classical mechanics whenever we progress in a con- 
tinuous way from the microscopic to the macroscopic realm. 

Notice that the lowest possible energy for this system occurs for n = 1 and is E = 
h? /8m L?. This remarkable result means that a constrained particle (i.e., L not infinite) 
can never have an energy of zero. Evidently, the particle continues to move about in 
the region 0 to L, even at a temperature of absolute zero. For this reason, h*/8m L? 
is called the zero-point energy for this system. In general, a finite zero-point energy 
occurs in any system having a restriction for motion in any coordinate. (Note that finite 
here means not equal to zero.) 

It is possible to show that, for L Z oo, our particle in a box would have to violate the 
Heisenberg uncertainty principle to achieve an energy of zero. For, suppose the energy 
is precisely zero. Then the momentum must be precisely zero too. (In this system, 
all energy of the particle is kinetic since V = 0 in the box.) If the momentum p, is 
precisely zero, however, our uncertainty in the value of the momentum A py is also 
zero. If Ap, is zero, the uncertainty principle [Eq. (2-46)] requires that the uncertainty 
in position Ax be infinite. But we know that the particle is between x — 0 and x = L. 
Hence, our uncertainty is on the order of L, not infinity, and the uncertainty principle 
is not satisfied. However, when L — oo (the particle is unconstrained), it is possible for 
the uncertainty principle to be satisfied simultaneously with having E — 0, and this is in 
satisfying accord with the fact that E = h? /8m L? goes to zero as L approaches infinity. 

Finally, we note that each separate value of n leads to a different energy. Thus, 
no two states have the same energy, and the states are said to be nondegenerate with 
respect to energy. 
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EXAMPLE 2-1 Consider an electron in a one-dimensional box of length 258 pm. 
a) Whatis the zero-point energy (ZPE) forthis system? Fora mole of such systems? 


b) Whatelectronic speed classically corresponds to this ZPE? Compare to the speed 
of light. 


SOLUTION » a) ZPE = Ejowest = En=1 
1)?(6.626 x 10734 J s)? 
= 17h? /8mL? = SNC det 3 
8(9.11 x 1073! kg) (258 x 10712 m)? 
= 9.05 x 10-1? J; 


Per mole, this equals 
(9.05 x 1071? J)(6.022 x 107? mol) (1133/10? J) = 54.5 kJ mol! 


b) £ is all kinetic energy since V =0 in the box, so E = mv? /2. Hence, 


1/2 -19 p 71/2 
x zi enam cq | a ae oes 
m 9.11 x 103! kg 


; ic ig 141x10°ms7! _ 
Compared to the speed of light, this is 3.998108 ms T ^ 0.0047, or about 0.5% of the speed of 


light. 4 


2-2.B Wavefunctions 


Weturn now to the eigenfunctions (2-11) for this problem. These are typically displayed 
by superimposing them on the energy levels as shown in Fig. 2-4 for the three lowest- 
energy wavefunctions. (It should be recognized that the energy units of the vertical 
axis do not pertain to the amplitudes of the wavefunctions.) 

It is apparent from Fig. 2-4 that the allowed wavefunctions for this system could 
have been produced merely by placing an integral number of half sine waves in the 
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Figure 2-4 » The eigenfunctions corresponding to n = 1, 2, 3, plotted on the corresponding energy 
levels. The energy units of the ordinate do not refer to the wavefunctions y. Each wavefunction has 
a zero value wherever it intersects its own energy level, and a maximum value of ./2/L. 
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range 0—L. Theresulting wavelengths would then yield the energy of each state through 
application of de Broglie's relation (1-42). Thus, by inspection of Fig. 2-4, the allowed 
wavelengths are 


A=2L/n, n=1,2,3,... (2-13) 
Therefore 
pHh/rA=nh/2L (2-14) 
and 
E= p! [2m — n? I? [8m L? (2-15) 


in agreement with Eq. (2-12). As pointed out in Section 1-11, the wavefunctions having 
higher kinetic energy oscillate more rapidly. (Here V — 0, and E is all kinetic energy.) 

Let us now consider the physical meaning of the eigenfunctions y. According to our 
earlier discussion, Y? summarizes the results of many determinations of the position 
of the particle. Suppose that we had a particle-in-a-box system that we had somehow 
prepared in such a way that we knew it to be in the state with n = 1. We can imagine 
some sort of experiment, such as flashing a powerful y-ray flashbulb and taking an 
instantaneous photograph, which tells us where the particle was at the instant of the 
flash. Now, suppose we wish to determine the position of the particle again. We want 
this second determination to be for the n = 1 state also, but we cannot use our original 
system for this because we have "spoiled" it by our first measurement process. Hitting 
the particle with one or more y-ray photons has knocked it into some other state, and 
we do not even know which one. Therefore, we must either reprepare our system, or 
else use a separate system whose preparation is identical to that of the first system. In 
general we shall assume that we have an inexhaustible supply of identically prepared 
systems. Therefore, we take a second photograph (on our second system) using the 
same photographic plate. Then we photograph a third system, a fourth, etc., until we 
have amassed a large number of images of the particle on the film. The distribution 
of these images is given by yz. (Since y is always a real function for this system, 
we do not need to bother with y/*y.) Other states, like V, v3, will lead to different 
distributions of images. The results for the several states are depicted in Fig. 2-5. It 
is obvious that the probability for finding the particle near the center of the box is 
predicted to be much larger for the n = 1 than the n — 2 state. 

The probability for finding the particle at the midpoint of the “wire” in the n = 2 state 
approaches zero in the limit of our measurement becoming precise enough to observe 
a single point. This troubles many students at first encounter because they worry about 
how the particle can get from one side of the box to the other in the n = 2 state. In fact, 
this question can be raised for any state whose wavefunction has any nodes. However, 
our discussion in the preceding paragraph shows that this question, like the question, 
“Is an electron a particle or a wave when we are not looking?" has no meaningful 
answer because no experiment can be conceived that would answer it. To test whether 
or not the particle does travel from one side of the box to the other, we would have to 
prepare the system in the n — 2 state and measure the position of the particle enough 
times so that we either (a) always find it on the same side (requires many measurements 
for confidence), or (b) find it on different sides (requires at least two measurements). 
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(n very large) 


v? 
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Figure 2-5 » y? and observed particle distribution for the three lowest-energy and one high-energy 
state of the particle in a one-dimensional box. 


But for our question to be answered, the system must be in the n — 2 state throughout 
this entire experiment, and we have seen that the process of measuring particle position 
prevents this. (If we find the particle first on the left and later on the right, we cannot 
be sure it did not travel across the midpoint while the system was perturbed by the first 
measurement.) Thus, the sketches in Fig. 2-5 are most safely regarded as a summary 
of the results of measurements on an ensemble of systems. 

Classically, since the particle has constant energy, hence constant speed, we would 
expect the particle to spend equal time in each line segment dx between 0 and L, but 
Fig. 2-5 shows that the quantum system with n = 1 predicts that the particle spends 
more time in segments near the center. It is characteristic of lower-energy states of 
quantum-mechanical systems to display "anti-classical" distributions. With higher 
quantum numbers, the distribution becomes difficult to distinguish from the distribution 
predicted by classical physics (see Fig. 2-5). This is another example of the tendency 
of quantum-mechanical predictions to approach classical predictions when one goes 
toward the macroscopic realm (here large n and therefore large £). 


EXAMPLE 2-2 Fora particle in the n = 2 state in a one-dimensional box of length L, 
a) estimate the probability, p, for finding the particle between x = 0 and x — 0.201. 


b) calculate the probability that you estimated in part a. 
c) what probability for finding the particle between x = 0 and x = 0.20 is predicted 
by classical physics? 
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SOLUTION » a) The sketch of v2 in Fig. 2-5 makes it clear that the probability for finding the 
particle in the range 0 € x < L/4 is 0.25 (equal to the area under the curve). The range 0 € x < 0.20L 
is 20% shorter, and the missing 20% of the range is associated with a relatively large probability— 
almost double the average in the range. That means we are missing nearly 40% of the probablity, so 
slightly more than 60% remains. 60% of 0.25 is 0.15, so the probability, o, in the range 0 — 0.20L 
is slightly larger than 0.15. 


b) 
0.2L 0.2L 
2 2. 
= f Wax== f sin? (=) dx 

0 L Jo L 
2 L 0.2L 2 2 
— ] | — f sin? ( Z Ja ( Z 
L) \2x) Jo L L 


1 0.47 
— Í sin?yd y 
7 JO 


1 
= (from Appendix 1)— | E 
z (2 


1 1. 
— {0.2m —— sino} 
T 4 


1 
= 0.20 — — sin 0.87 — 0.153 
4r 


c) The classical particle travels with constant speed, hence has a constant probability function. There- 
fore, the probability for finding the particle in any 20% of the box is 0.20. 4 


2-2.C Symmetry of Wavefunctions 


Inspection of Fig. 2-5 shows that the particle has equal probabilities for being observed 
in the left half and right half of the box, regardless of state. This seems reasonable 
because there is no physical factor discriminating between these halves. We shall now 
show that the hamiltonian operator is invariant for a reflection through the box center, 
and that a necessary consequence of this is that y has certain symmetry properties. 

First, we show that H is invariant. Reflection through the box center is accomplished 
by replacing x by —x + L. We can define a reflection operator R such that R f(x) = 
f(—x + L); i.e., R reflects any function through a plane normal to x at x = L/2 (see 
Fig. 2-6). 


Imaginary reflecting plane 


Figure 2-6 » A function f(x) and its mirror image reflected at x = L/2. 
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The kinetic part of the hamiltonian, T, is unchanged by R: 


h^ oq? —h? d? 
RT=R = 
| 82m Ed 822m d (—x + LY? 
he d? 
~ 82m dx? pu) 


where we have used the fact that L is constant and d/d(—x) — —d/dx. That the 
potential part of H is unchanged by reflection through L/2 is easily seen; the identical 
infinite barriers merely interchange position. Therefore, RT — T and RV — V , and 
RH=RT+V)=RT+RV=TIV=H4. 

Now let us see what this means for eigenfunctions of H. Assume we have a nor- 
malized eigenfunction v 


HW=Ew (2-17) 


The two sides of Eq. (2-17) will still be equal if we reflect our coordinate system 
throughout the equation. (If two functions are identical in one coordinate system, say 
a right handed system, then they are identical in any coordinate system.) Therefore,” 


(RA)(Ry) = CRE)(RV) (2-18) 


But £ is simply a constant, and so it is immune to R. Furthermore, we have just seen 
that RH — H. Therefore, 


H(Ry) = E(Ry) Q-19) 


which shows that the function Ry is an eigenfunction of H with the same eigenvalue 
as y. 

We have already mentioned that the eigenfunctions of this system are nondegenerate 
with respect to energy. This is equivalent to saying that no two linearly independent 
eigenfunctions having the same eigenvalue exist for this system. But we have just shown 
that y and Rw are both eigenfunctions having the same eigenvalue E. Therefore, we 
are forced to conclude that y and Ry are linearly dependent, that is, 


Ry —cy (2-20) 


where c is a constant. A moment’s thought shows that Rw must still be normalized 
(since reflecting a function does not change its area or the area under its square), and 
it also must still be real (since reflecting a real function does not introduce imaginary 
character). Therefore, 


£ L L 
/ (yas i [ (eu axe e | pdx 2c? (2-21) 
0 0 0 


where we have made use of the fact that y is normalized. If c? = 1, then c= +1 and 


Ry =+w (2-22) 


?The parentheses in Eq. (2-18) are meant to show the restricted extent of operation of R. This is a departure 
from the usual mathematical notational convention, but it is hoped that this temporary departure results in greater 
clarity for the student. 
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When Ry — ry, as is the case for V or W3 (Fig. 2-4), v is said to be symmetric, or 
even, for reflection. If Ry = —v, as for Y2, v is said to be antisymmetric, or odd. 
(A function that is neither symmetric nor antisymmetric is said to be unsymmetric, or 
asymmetric. Be careful to avoid confusing "asymmetric" with "antisymmetric.") 

We have proved a very important property of wavefunctions. In general, if Vy is the 
wavefunction for a nondegenerate state, it must be symmetric or antisymmetric under 
any transformation that leaves H unchanged. 


2-2.D Orthogonality of Wavefunctions 


Itis possible to show that integration over the product of two different particle-in-a box 
eigenfunctions, V, and Ym, must give zero as the result: 


L 
Í Wn Vm dx —0, nzm (2-23) 
0 


When functions have this property—that their product gives zero when integrated over 
the entire range of coordinates—they are said to be orthogonal. (Since the “box” 
eigenfunctions vanish for x <0 or x > L, integration from 0 to L suffices.) 

We can use symmetry arguments to demonstrate orthogonality among certain pairs 
of “box” eigenfunctions, for example, v1 and y». Figure 2-7 shows that, since v1 
is symmetric and y» is antisymmetric for reflection, the product of these functions is 
antisymmetric. (In fact, it is not difficult to show in general that the product of two 
symmetric or of two antisymmetric functions is symmetric, and that an antisymmetric 
function times asymmetric function gives an antisymmetric product. See Problem 2-7.) 
Integration over an antisymmetric function must give zero as the result since an antisym- 
metric function has to have equal amounts of positive and negative area. Therefore, v1 
and y are orthogonal “by symmetry" as, indeed, are all the symmetric-antisymmetric 
pairs of wavefunctions. Since all y’s having odd quantum number n are symmetric, 
and all y’s having even n are antisymmetric, we have used symmetry to prove w, and 


Figure 2-7 » yı is symmetric, V? is antisymmetric, and v V? is antisymmetric. The total signed 
area bounded by the odd functions is zero since complete cancellation of positive and negative com- 
ponents occurs. 
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V = U in this 
region of x 


Region | L Region I! 


Figure 2-8 » The potential for a one-dimensional “box” with one infinite barrier at x < 0 and a 
barrier of V =U at x > L. 


Wm orthogonal for n even and m odd. To show orthogonality for n and m both even or 
both odd requires doing the integral out explicitly (Problem 2-9). 

The eigenfunctions (2-11) are orthogonal to each other and individually normalized, 
and we refer to them as orthonormal functions. Mathematically, this is summarized as 


L 
f ete - Dmm = n.m (2-24) 
0 


l, n=m 


The quantity ôn, m is called the Kronecker delta function, and it is merely a convenient 
shorthand for the information in the braces. 


EXAMPLE 2-3 yı and y3 are both symmetric functions. Therefore their product, 


V1V3, is symmetric. How, then, can the integral of this product vanish? 


SOLUTION » A sketch of the product of these functions shows it to be symmetric, with negative 
values in the central region and wings of positive values on each side. Since y1 and y3 are known to 
be orthogonal, the negative region must exactly cancel the sum of the two positive regions (though 
we wouldn't know this for sure from a sketch). This shows that, whereas the integral over an 
antismmetric integrand must equal zero, an integral over a symmetric (or unsymmetric) integrand 
may or may not equal zero. 4 


E 2-3 The Particle in a One-Dimensional “Box” 


with One Finite Wall 


Let us now modify the system just discussed by lowering the potential on one side of 
the “box” to some finite value U. The resulting potential is shown in Fig. 2-8. We can 
think of a bead on a wire encountering infinite repulsion at x = 0 and finite repulsion 
for x > L. As before, it is convenient to break up the problem into separate regions of 
x. For the region x <0 where V is infinite, y must be zero for the same reasons as 
before (Section 2-1). 

When the particle is in region I of Fig. 2-8, V — 0 and all is identical to our earlier 
box. Therefore, in this region we will have harmonic waves of the general form 


Vr = Arsin(2zrx/A1) + Bycos(2z x / Ay) (2-25) 
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where we have used the form (1-24) in which the wavelength appears explicitly. As 
before, the boundary condition that w vanish at x — 0 forces Bj to vanish, leaving 


Vr — Arsin(2zt x /Aq) (2-26) 


For the moment we have no other boundary condition on yy because we do not know 
that w equals zero at the finite barrier. We do know, however, that the wavelength A1, 
whatever it turns out to be, will be related to the energy through 


M= A//2m(E — Vj) Q-27) 


and, since Vj = 0 (in region J), 
Ap— Ah/A/2mE (2-28) 


which is a real number for positive E. 
We now turn to region II. Since V is constant here also, y will again be a harmonic 
wave. As before, we have our choice of two general forms: 


Vr = Ag sin(2z x /Ag) + By cos(27 x/r) (2-29) 
or [see Eq. (2-3)] 
Vr = Cy exp(4-2zix/AXgm) + Dy exp(—27ix/An) (2-30) 


There are two possibilities for the energy of the particle: E < U and E > U. The first 
of these corresponds to the classical situation where the particle has insufficient energy 
to escape from the box and get into region II. Let us see what quantum mechanics says 
about this case in region II. 

For this case, Ar is imaginary since 


Ap =h//2m(E — U) (2-31) 


and E — U is negative. Because Ag is imaginary, it is more convenient to use the 
general form (2-30) because then the ; in the exponential argument can combine with 
the i of Ay to produce a real argument. Let us assume that Ay is equal to 7 times a 
positive number. (This will not affect our results.) 

Let us now examine the properties of the two exponential functions in Eq. (2-30). 
The first exponential has an argument that is real (because the i's cancel) and positive 
(because of our above assumption). As x increases, this exponential increases rapidly, 
approaching infinity. Since acceptable functions do not blow up like this, we set Cy 
equal to zero to prevent it. The second exponential has a negative, real argument, so it 
decays exponentially toward zero as x approaches infinity. This is acceptable behavior, 
and we are left with 


Vir = Dy exp(—2z ix /Aq) (2-32) 


We now have formulas describing fragments of the wavefunction for the two regions. 
All that remains is to join these together at x = L in such a way that the resulting 
wavefunction is continuous at x — L and has a continuous first derivative there. (Recall 
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from Section 2-1 that this second requirement results from the fact that the potential is 
finite at x = L. Hence, y must be smooth at x = L.) 
The continuity requirement gives 


Aysin(2a L/Aj) = Dg exp(—2zi L/Ay) (2-33) 


Taking the derivatives of yy and yy and setting these equal at x = L (to force smooth- 
ness) gives 


(27/4) Apcos2z L/Aq) = (—2zi/Ag) Dy exp(—2zi L/Aq) (2-34) 


The exponential term is common to both Eqs. (2-33) and (2-34), providing the basis 
for another equality: 


Ajsin(2z L/A1) = (— AyAy/iAq) cos(2zt L / Aq) (2-35) 
Or 
tan(2z L/Aj) — iAg/A1 (2-36) 
Substituting for Ay and Ay as indicated by Eqs. (2-28) and (2-31) gives 
tan(2z LV2m E/ h) 2 —-/EJAU — E (2-37) 


The only unknown in Eq. (2-37) is the total energy E. For given values of L, m, and 
U, only certain values of E < U will satisfy Eq. (2-37). Thus, the particle can have 
only certain energies when it is trapped in the “box.” These allowed energies can be 
found by graphing the left-hand side and right-hand side of Eq. (2-37) as functions of 
E. The values of E where the plots intersect satisfy Eq. (2-37). Figure 2-9 illustrates 
the graphical solution of Eq. (2-37) for a particular set of values for L, m, and U. 

Once a value of E is selected, A; and Ay are known [from Eqs. (2-28) and (2-31)] and 
it remains only to find A; and Dy. The ratio 41/ Dy may be found from Eq. (2-33). The 
numerical values of A; and Dy will then be obtainable if we require that the wavefunc- 
tion be normalized. A set of such solutions is shown in Fig. 2-10. 

Before solving for the case where E > U, let us discuss in detail the results just 
obtained. 

In the first place, the energies are quantized, much as they were in the infinitely deep 
square well. There is some difference, however. In the infinitely deep well or box, the 
energy levels increased with the square of the quantum number n. Here they increase 
less rapidly (the dashed lines in Fig. 2-10 show the allowed energy levels which result 
when U = oo) because the barrier becomes effectively less restrictive for particles with 
higher energies (see the following). For the lowest solution, for example, slightly less 
than one-half a sine wave is needed in one box width of distance. Thus, the wavelength 
here is slightly longer than in an infinitely deep well of equal width, and so, by de 
Broglie's relation, the energy is slightly lower. Notice that the effect of lowering the 
height of one wall is least for the levels lying deepest in the well. 

The solutions sketched in Fig. 2-10 indicate that there is a finite probability for 
finding the particle in the region x > L even though it must have a negative kinetic 
energy there. Thus, quantum mechanics allows the particle to penetrate into regions 
where classical mechanics claims it cannot go. Notice that the penetration becomes 
more appreciable as the energy of the particle approaches that of the barrier. This results 
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0 2 4 6 8 10 12 14 16 
E x 1070 J 


Figure 2-9 » Graphical solution of the equation — tan(2z L/2m E/ h) = /E//U—E. Here 
L — 2.50 nm, m — 9.11 x 1073! kg, U=1 eV = 16.02 x 10720 J. Intersections occur at E = 
0.828 x 10720 J, 3.30 x 1029 J, 7.36 x 10-79 J and 12.8 x 10779 J. 


from the fact that E — U determines the rate at which the exponential in Yyy decays 
[see Eqs. (2-31) and (2-32)]. In the limit that U — co, the wavefunction vanishes at 
the barrier, in agreement with the results of the infinite square well of Section 2-1. 

If the barrier in Fig. 2-8 has finite thickness (V becomes zero again at, say, x = 2L), 
then there is a finite probability that a particle in the well will penetrate through the 
barrier and appear on the other side. This phenomenon is called quantum-mechanical 
tunneling, and this is the way, for example, an o particle escapes from a nucleus even 
though it classically lacks sufficient energy to overcome the attractive nuclear forces. 
We emphasize that the tunneling referred to in this example is really not a stationary 
state phenomenon. We have an initial condition (particle in the well) and ask what the 
half-life is for the escape of the particle—a time-dependent problem. 

We saw earlier that the energy quantization for the particle in the infinitely deep well 
could be thought of as resulting from fitting integral numbers of half sine waves into 
a fixed width. Most sine waves just will not fit perfectly, and so most energies are not 
allowed. In this problem the waves are allowed to leak past one of the well walls, but 
we can still see why only certain energies are allowed. Suppose that we pick some 
arbitrary energy £ for the particle. We know that y must be zero at the left wall of the 
well where V — oo. Starting there, we can draw a sine wave of wavelength determined 
by E across the well to the right wall, as shown in Fig. 2-11. When the wave hits the 
right wall, it must join on smoothly to a decaying exponential, which also depends on 
E. Most of the time, it will be impossible to effect a smooth junction, and that particular 
value of E will be disallowed. 

Let us now consider the case where E > U. In region I, the considerations are the 
same as before. Then, vy is a sine wave that can be drawn from the left wall and has a 
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Figure 2-10 » Solutions for particle in well with one finite wall (see Fig. 2-9 for details). Dashed 
lines correspond to energy levels which would exist if U = oo. 


wavelength determined by E(— T) from de Broglie’s relation. This sine wave arrives 
at x = L with a certain magnitude and a certain derivative (assuming that the multiplier 
A, has been fixed at some arbitrary value). In region II, we also have a solution of the 
usual form 


yux) = Ay sin(2ztx /Aq) + By cos(27 x / Agr) (2-38) 


where Ay is real and determined by E — U, which is now positive. The question is, can 
we always adjust Ay (by changing Ay and By) so that it has the same value and slope 
at x = L that yy has? A little thought shows that such adjustment is indeed always 
possible. The two adjustments allowed in wy correspond to a change of phase for Yy 
(a shift in the horizontal direction) and a change in amplitude for Yy. The only thing 
about wy we cannot change is the wavelength, since this is determined by E — U. This 
is just a physical description of the mathematical circumstance in which we have two 
adjustable parameters and two requirements to fit—a soluble problem. The essential 
difference between this case and that of the trapped particle is that here we have fewer 
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x 


Figure 2-11 » An example of partial wavefunctions for an arbitrary energy E. These functions 
cannot be joined smoothly at x = L and so this value of £ is not allowed. 


boundary conditions. Before, our square-integrability requirement was used to remove 
a positive exponential term. That requirement is, in effect, a boundary condition—y 
must vanish at x = co—and it led to energy quantization. Then we used the normality 
requirement to achieve unique values for Ay and Dg. In this case we cannot get a 
square-integrable solution. yy goes on oscillating as x — oo, and so we have no 
boundary condition there. As a result, E is not quantized and w is not normalizable, 
so that only ratios of 4y, Ay, and By are obtainable. 

The energy scheme for the particle in the potential well with one finite wall, then, is 
discrete when E < U, and continuous when E > U. 

Notice the way in which the wavelengths vary in Fig. 2-10. We have already seen 
that the time-independent Schródinger equation states that the total energy for a particle 
in a stationary state is the same at all particle positions (i.e., a constant of motion). The 
kinetic and potential energies must vary together, then, in such a way that their sum is 
constant. This is reflected by the fact that the wavelength of an unbound solution is 
shorter in region I than it is in region II. In region I, V = 0, so that all energy of the 
particle is kinetic (T = E). In region IL, V > 0, so that the kinetic energy (T = E — V) 
is less than it was in region I. Therefore, the de Broglie wavelength, which is related to 
kinetic energy, must be greater in region II. 


EXAMPLE 2-4 For the system described in the caption for Fig. 2-9, calculate the 


percentage drop of the lowest-energy state that results from barrier penetration. 


SOLUTION » For this, we need to solve the problem for the simple particle-in-a-box system 
(for which U — oo). 
n (1)? (6.626 x 10-34 Js)? 


E = —9.64 x 107?! J 
8mL2  8(9.11 x 10-3! kg) (2.500 x 10-10 m)? 


1 


compared to 8.28 x 10-2! J. Barrier penetration lowers E, by 14%. 4 
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N 2-4 The Particle in an Infinite “Box” with 
a Finite Central Barrier 


Another example of barrier penetration in a stationary state of a system is provided by 
inserting a barrier of finite height and thickness at the midpoint of the infinite square 
well of Section 2-1 (see Fig. 2-12). 

The boundary conditions for this problem are easily obtained by obvious extensions 
of the considerations already discussed. Rather than solve this case directly, we shall 
make use of our insights from previous systems to deduce the main characteristics of 
the solutions. Let us begin by considering the case where the barrier is infinitely high. 


(a) 


(b) 


(d) 


Figure 2-12 » (a) Solutions for identical infinite square wells. (b) Effect of finite partition on half 
waves. (c) Symmetric combination of half waves. (d) Antisymmetric combination of half waves. 
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Then the problem becomes merely that of two isolated infinite square wells, each well 
having solutions as described in Sections 2-1 and 2-2. 

Now, as the height of the barrier is lowered from infinity, what happens? The levels 
lying deepest in the two sections should be least affected by the change. They must 
still vanish at the outer walls but now they can penetrate slightly into the finite barrier. 
Thus, the lowest state in, say, the left-hand section of the well will begin to look as 
given in Fig. 2-12b. The solution on the right side will do likewise, of course. As 
this happens, their energies will decrease slightly since their wavelengths increase. 
However, since the two wells are no longer separated by an infinite barrier, they are no 
longer independent. We can no longer talk about separate solutions for the two halves. 
Each solution for the Schródinger equation is now a solution for the whole system from 
x — —L to +L. Furthermore, symmetry arguments state that, since the hamiltonian for 
this problem is symmetric for reflection through x — 0, the solutions, if nondegenerate, 
must be either symmetric or antisymmetric through x — 0. 

This requirement must be reconciled with the barrier-penetration behavior indicated 
by Fig. 2-12b, which is also occurring. One way to accomplish this is by summing the 
two half waves as shown in Fig. 2-12c, giving a symmetric wavefunction. Alternatively, 
subtraction gives the antisymmetric form shown in Fig. 2-12d. Both of these solutions 
will be lower in energy than their infinite-well counterparts, because the wavelengths 
in Fig. 2-12c and d continue to be longer than in 2-12a. Will their energies be equal to 
each other? Not quite. By close inspection, we can figure out which solution will have 
the lower energy. 

In Figs. 2-12b to 2-12d, the slopes of the half wave, the symmetric, and the antisym- 
metric combinations at the finite barrier are labeled respectively m, m', and m". What 
can we say about their relative values? The slope m' should be less negative than m 
because the decaying exponential producing m has an increasing exponential added to 
it when producing m’. Slope m" should be more negative than m since the decaying 
exponential has an increasing exponential subtracted from it in case d, causing it to 
decay faster. This means that the sine curve on the left-hand side of Fig. 2-12c cannot 
be identical with that on the left side of Fig. 2-12d since they must arrive at the barrier 
with different slopes. (The same is true for the right-hand sides, of course.) How 
can we make the sine wave arrive with a less negative slope m'?—by increasing the 
wavelength slightly so that not quite so much of the sine wave fits into the left well 
(see Fig. 2-13a). Increasing the wavelength slightly means, by de Broglie’s relation, 
that the energy of the particle is decreased. Similarly, the sine curve in Fig. 2-12d must 
be shortened so that it will arrive at the barrier with slope m", which corresponds to 
an energy increase. Of course, now that the energy has changed outside the barrier, it 
must change inside the barrier too. This would require going back and modifying the 
exponentials inside the barrier. But the first step is sufficient to indicate the qualitative 
results: The symmetric solution has lower energy. In Fig. 2-13a is a detailed sketch of 
the final solution for the two lowest states. 

There is a simpler way to decide that the symmetric solution has lower energy. As 
the barrier height becomes lower and lower, the two solutions become more and more 
separated in energy, but they always remain symmetric or antisymmetric with respect to 
reflection since the hamiltonian always has reflection symmetry. In the limit when the 
barrier completely disappears we have a simple square well again (but larger), the lowest 
solution of which is symmetric. (See Fig. 2-13b.) This lowest symmetric solution must 
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(b) 
Figure 2-13 » (a) Detailed sketch of the two lowest solutions for the infinite square-well divided 
by a finite barrier at the midpoint. The waves are sketched from a common energy value for ease of 


comparison. Actually, the symmetric wave has a lower energy. (b) A correlation diagram relating 
energies when the barrier is infinite (left side) with those when the barrier vanishes. Letters A and S 


refer to antisymmetric and symmetric solutions, respectively. 


"come from" the symmetric combination of smaller-well wavefunctions sketched at the 
left of Fig. 2-13b; similarly, the second lowest, antisymmetric solution of the large well 
correlates with the antisymmetric small-well combination (also at left in Fig. 2-13b). 
A figure of the kind shown in Fig. 2-13b is called a correlation diagram. It shows how 
the energy eigenvalues change throughout a continuous, symmetry-conserving process. 
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We shall see that the correlation of wavefunction symmetries in such a manner as this 
is a powerful technique in understanding and predicting chemical behavior. 

The splitting of energy levels resulting from barrier penetration is an extremely 
pervasive phenomenon in quantum chemistry. It occurs regardless of whether the barrier 
separates identical or nonidentical potential regions, i.e., regardless of whether the final 
system is symmetric or unsymmetric. When two atoms (N and N, or C and O) interact 
to form a molecule, the original atomic wavefunctions combine to form molecular 
wavefunctions in much the same way as was just described. One of these molecular 
wavefunctions may have an energy markedly lower than those in the corresponding 
atoms. Electrons having such a wavefunction will stabilize the molecule relative to the 
separated atoms. 

Another case in which energy level splitting occurs is in the vibrational spectrum 
of ammonia. Ammonia is most stable in a pyramidal configuration, but is capable 
of inverting through a higher-energy planar configuration into an equivalent "mirror 
image" pyramid. Thus, vibrations tending to flatten out the ammonia molecule occur 
in a potential similar to the double well, except that in ammonia the potential is not 
discontinuous. The lowest vibrational energy levels are not sufficiently high to allow 
classical inversion of ammonia. However, these vibrational levels are split by inter- 
action through barrier penetration just as quantum mechanics predicts. The energy 
required to excite ammonia from the lowest of these sublevels to its associated sublevel 
can be accurately measured through microwave spectroscopy. Knowledge of the level 
splittings in turn allows a precise determination of the height of the barrier to inversion 
in ammonia (see Fig. 2-14). 

Itis easy to anticipate the appearance of the solutions for the square well with central 
barrier for energies greater than the partition height. They will be sinusoidal waves, 
symmetric or antisymmetric in the well, and vanishing at the walls. Their wavelengths 
will be somewhat longer in the region of the partition than elsewhere because some of 
the kinetic energy of the particle is transformed to potential energy there. A sketch of 
the final results is given in Fig. 2-15. 


EXAMPLE 2-5 Fig. 2-15 shows energy levels for states when the barrier has finite 
height. When the barrier is made infinitely high, the levels at Z; and E»? merge 


into one level. Where does the energy of that one level lie—below Ej, between 
E, and E», or above E£»—and why? 


SOLUTION » It lies above Ey. When the barrier is finite, there is always some penetration, so 
à is always at least a little larger than is the case for the infinite barrier. If A is larger, E is lower. «4 


E 2-5 The Free Particle in One Dimension 


Suppose a particle of mass m moves in one dimension in a potential that is everywhere 
zero. The Schrödinger equation becomes 


—h2 aw 


—— -=E 2-39 
emae Y re) 
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Figure 2-14 » Sketch of potential for inversion vibrational mode in ammonia. The lowest levels 
are split by tunneling. The low energy transition A E is visible in the microwave region whereas the 
second transition A E» is visible in the infrared. AE, =0.16 x 10-22 J; AE, — 7.15 x 10-22 J. 


which has as solutions 
y = Aexp(2ziv/2m Ex / h) (2-40) 
or alternatively, trigonometric solutions 
y — A'sinQz 2m Ex/h), y= A cos(2zt /2m Ex / h) (2-41) 


As is most easily seen from the exponential forms (2-40), if E is negative, w will 
blow up at either +00 or —oo, and so we reject negative energies. Since there are 
no boundary conditions, it follows that E can take on any positive value; the energies 
of the free particle are not quantized. This result would be expected from our earlier 
results on constrained particles. There we saw that quantization resulted from spatial 
constraints, and here we have none. 

The constants A and A’ of Eqs. (2-40) and (2-41) cannot be evaluated in the usual 
way, since the solutions do not vanish at x = +00. Sometimes it is convenient to 
evaluate them to correspond to some experimental situation. For instance, suppose that 
one was working with a monoenergetic beam of electrons having an intensity of one 
electron every 1076 m. Then we could normalize y of Eq. (2-40) so that 


1076 m 
Í Iy dx=1 
0 


There is a surprising difference in the particle distributions predicted from expres- 
sions (2-40) and (2-41). The absolute square y*w of the exponentials is a constant 
(A*A), whereas the squares of the trigonometric functions are fluctuating functions 
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Figure 2-15 » Wavefunctions for the infinite square well with finite partition. 


of x. It seems sensible for a particle moving without restriction to have a constant 
probability distribution, and it seems absurd for it to have a varying probability distribu- 
tion. What causes this peculiar behavior? It results from there being two independent 
solutions for each value of E (except E = 0). This degeneracy with respect to energy 
means that from a degenerate pair, y and y’, one can produce any number of new 
eigenfunctions, Y” — ay + bw’ (Problem 2-11). In such a situation, the symmetry 
proof of Section 2-2 does not hold. However, there will always be an independent pair 
of degenerate wavefunctions that will satisfy certain symmetry requirements. Thus, in 
the problem at hand, we have one pair of solutions, the exponentials, which do have 
the proper symmetry since their absolute squares are constant. From this pair we can 
produce any number of linear combinations [one set being given by Eq. (2-41)], but 
these need not display the symmetry properties anymore. 

The exponential solutions have another special attribute: A particle whose state is 
described by one of the exponentials has a definite linear momentum, whereas, when 
described by a trigonometric function, it does not. In Section 1-9, it was shown that 
the connection between classical and wave mechanics could be made if one related the 
classical momentum, px, with a quantum mechanical operator (h/27i)d/dx. Now, 
for a particle to have a definite (sharp) value p for its momentum really means that, 
if we measure the momentum at some instant, there is no possibility of getting any 
value other than p. This means that the particle in the state described by y always has 
momentum p, no matter where it is in x; i.e., its momentum is a constant of motion, 
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just as its energy 1s. This corresponds to saying that there is an eigenvalue equation for 
momentum, just as for energy. Thus 


ELLE (2-42) 


The statement made earlier, that the exponential solutions correspond to the particle 
having sharp momentum, means that the exponentials (2-40) must be solutions to 
Eq. (2-42). This is easily verified: 


zzi. [Ae ( EAE) |- svae as tiros | 


Thus, the positive and negative exponential solutions correspond to momentum values 
of --V/2m E and —4/ 2m E, respectively, and are interpreted as referring to particle 
motion toward +00 and —oo respectively. Since energy is related to the square of 
the momentum, these two solutions have identical energies. (The solution for E = 0 
corresponds to no momentum at all, and the directional degeneracy is removed.) A 
mixture of these states contains contributions from two different momenta but only one 
energy, so linear combinations of the exponentials fail to maintain a sharp value for 
momentum but do maintain a sharp value for energy. 


EXAMPLE 2-6 An electron is accelerated along the x axis towards x = co from rest 
through a potential drop of 1.000 kV. 


a) What is its final momentum? 
b) What is its final de Broglie wavelength? 
c) What is its final wavefunction? 


SOLUTION » a) py = V2mE = [2(9.105 x 107?! kgy(1.602 x 107!6J)]!/2 = 1.708 x 


10-23 kgms-! 
_h_ 66206x10*J]s _ —11 
DA= P^ 1.708x10-23kgms~! — STIR oe 


c) y = A exp(4-27zti / 2m Ex / h)(choose + because moving towards x = +00) = AexpQ'zrip/ h)x 
= Aexp(2mix/A) = A exp[2mi (2.578 x 1019 m—!)x]. 4 


E 2-6 The Particle in a Ring of Constant Potential 


Suppose that a particle of mass m is free to move around a ring of radius r and zero 
potential, but that it requires infinite energy to get off the ring. This system has only 
one variable coordinate—the angle $. In classical mechanics, the useful quantities and 
relationships for describing such circular motion are those given in Table 2-1. 

Comparing formulas for linear momentum and angular momentum reveals that the 
variables mass and linear velocity are analogous to moment of inertia and angular veloc- 
ity in circular motion, where the coordinate $ replaces x. The Schrödinger equation 
for circular motion, then, is 


-h^ d^y ($) 
8x21 dg? 


= EV) (2-43) 
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TABLE 2-1 » 
Quantity Formula Units 
Moment of inertia I-mr? gcm? or kg m? 
Angular velocity w= A^o/ At — vr gl 
Angular momentum (linear mur — Io g cm?/s or erg s or Js 
momentum times orbit radius) 
which has, as solutions 
A exp(XikQ) (2-44) 

or alternatively 

A' sin(k$) (2-45) 
and 

A’ cos(k@) (2-46) 


where [substituting Eq. (2-45) or (2-46) into (2-43) and operating] 
k=2nV2IE/h (2-47) 


Let us solve the problem first with the trigonometric functions. Starting at some 
arbitrary point on the ring and moving around the circumference with a sinusoidal func- 
tion, we shall eventually reencounter the initial point. In order that our wavefunction 
be single valued, it is necessary that y repeat itself every time $ changes by 27 radians. 
Thus, for $ given by Eq. (2-45), 


sin(k@) = sin[k(@ + 2z)] (2-48) 
Similarly, for Y given by Eq. (2-49) 
cos(k@) — cos(kó + 2kz) (2-49) 


Either of these relations is satisfied only if k is an integer. The case in which k = 0 is 
not allowed for the sine function since it then vanishes everywhere and is unsuitable. 
However, k = 0 is allowed for the cosine form. The normalized solutions are, then, 


y = (1/Anz)sin(kQ), k=1,2,3,... 
y = (u//nz)cos(kó), k=1,2,3,... 
y — (1/ V2) (from the k — 0 case for the cosine) (2-50) 


Now let us examine the exponential form of y (Eq. 2-44). The requirement that y 
repeat itself for p > p 4- 27 gives 


A exp(XikQ) = A exp[+ik(¢@ --2z)] = Aexp(tikQ) exp(C2z ik) 
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Or 
exp(+27ik) =1 
Taking the positive case and utilizing Eq. (2-3), we obtain 
cos(Qzk)--isin(2zk)—1 or cos(27k)=1 and sin(2mrk)=0 


Again, k must be an integer. (The same result arises by requiring that dw /d@ repeat 
for 6 > $ + 2z.) Thus, an alternative set of normalized solutions is 


= (12x) expkó) k=0,+1,+2,+3,... (2-51) 
The energies for the particle in the ring are easily obtained from Eq. (2-47): 
E-K P 8x?I, k=0,+1,+2,+3,... (2-52) 


The energies increase with the square of k, just as in the case of the infinite square 
well potential. Here we have a single state with E = 0, and doubly degenerate states 
above, whereas, in the square well, we had no solution at E = 0, and all solutions were 
nondegenerate. The solution at E — 0 means that there is no finite zero point energy 
to be associated with free rotation, and this is in accord with uncertainty principle 
arguments since there is no constraint in the coordinate $. 

The similarity between the particle in a ring and the free particle problems is strik- 
ing. Aside from the fact that in the ring the energies are quantized and the solutions 
are normalizable, there are few differences. The exponential solutions (2-51) are 
eigenfunctions for the angular momentum operator (h/2xi)d /d$. The two angular 
momenta for a pair of degenerate solutions correspond to particle motion clockwise or 
counterclockwise in the ring. (The nondegenerate solution for E — 0 has no angular 
momentum, hence no ability to achieve degeneracy through directional behavior.) The 
particle density predicted by the exponentials is uniform in the ring, while that for the 
trigonometric solutions is not. Since the trigonometric functions tend to localize the 
particle into part of the ring, thereby causing Ad z co, it is consistent that they are 
impure momentum states (A ang. mom. #0). (Infinite uncertainty in the coordinate 
$ means that all values of $ in the range 0-27 are equally likely.) 


EXAMPLE 2-7 Demonstrate that any two degenerate exponential eigenfunctions 


for a particle in a ring are orthogonal. 


SOLUTION » Such a pair of degenerate wavefunctions can be written as y. = A exp(ik@), 


ee | a sp (27 us 
y= Tr exp(—ikQ). These are orthogonal if Jo [an V d$ —0 where we must use the complex 


conjugate of either one of the wavefunctions since y is complex. But yi = _,so 
20 1 2x 1 27 
y-y-dọ = | exp(-2i$) dọ = | [cos(20) — i sin29)]d$ 
2z Jo 2z Jo 

1 

=> [singe IE cos(2¢)13" | 
1 

= 5, SP) — sin(0) + i cos(4z) — i cos(0)] 
T 


1 
= mn ee a 
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N 2-7 The Particle in a Three-Dimensional 
Box: Separation of Variables 


Let us now consider the three-dimensional analog of the square well of Section 2-1. 
This would be a three-dimensional box with zero potential inside and infinite potential 
outside. As before, the particle has no probability for penetrating beyond the box. 
Therefore, the Schródinger equation is just 

EF h2 ( 32 32 32 


at Bp ax) ¥= EU (2-53) 


822m 


and w vanishes at the box edges. 

The hamiltonian operator on the left side of Eq. (2-53) can be written as a sum of 
operators, one in each variable (e.g., Hy = (—42/8z?m)9?/8x?). Let us assume for 
the moment that y can be written as a product of three functions, each one being a 
function of a different variable, x, y, or z (i.e., Y = X(x)Y(y)Z(z)). If we can show 
that such a y satisfies Eq. (2-53), we will have a much simpler problem to solve. Using 
this assumption, Eq. (2-53) becomes 


(Ay + H; + H)XYZ- EXYZ (2-54) 
This can be expanded and then divided through by XY Z to obtain 


H,XYZ | H,XYZ | H;XYZ 
l —E (aconstant) (2-55) 
XYZ XYZ XYZ 
Now, since Hy, for example, operates only on functions of x, but not y or z, we 
can carry out some limited cancellation. Those functions that are not operated on 
in a numerator can be canceled against the denominator. Those that are operated on 
cannot be canceled since these are differential operators [e.g., in (1/x)dx? /dx it is not 
permissible to cancel 1/x against x? before differentiating: (1/x)dx?/dx Adx/dx]. 
Such cancellation gives 
HX HyY HZ 
l | =E  (aconstant) (2-56) 
X Y Z 

Now, suppose the particle is moving in the box parallel to the x axis so that the variables 
y and z are not changing. Then, of course, the functions Y and Z are also not changing, 
so H,Y/Y and H,Z/Z are both constant. Only Hy X/ X can vary—but does it vary? 

Not according to Eq. (2-56), which reduces under these conditions to 


x 


X 


Therefore, even though the particle is moving in the x direction, Hy X/ X must also be a 
constant, which we shall call £y. Similar reasoning leads to analogous constants £ , and 
E-z. Furthermore, the behavior of Hy X/ X must really be independent of whether the 
particle is moving parallel to the y and z axes. Even if y and z do change, they do not 
appear in the quantity H, X/ X anyway. Thus we may write, without restriction, 


+ constant + constant = E (a constant) (2-57) 


BX p HY op HO 


X Y Ne Z 


E. (2-58) 


54 Chapter 2 Quantum Mechanics of Some Simple Systems 


and, from Eq. (2-56), 
Ex + Ey+ E;—-E (2-59) 


Our original equation in three variables has been separated into three equations, one in 
each variable. The first of these equations may be rewritten 


H; X = Ex X (2-60) 


which is just the Schrödinger equation for the particle in the one-dimensional square 
well, which we have already solved. For a rectangular box with Ly z L, Æ L- we have 
the general solution 


y = XYZ = /2/L,sin (nxn x/Lx) /2/L sin (nyx y/Ly) /2/L; sin (nzxz/ L;) 


(2-61) 
and 
E = E, + E, + E, = (h?/8m) (i E n1) (2-62) 
For a cubical box, L = L, = L; = L, and the energy expression simplifies to 
E= (h?/8m L?) (n? + n; + n2) (2-63) 
The lowest energy occurs when ny =n, =n; = 1, and so 
E (1) 23? /8m L? (2-64) 


Thus, the cubical box has three times the zero point energy of the corresponding one- 
dimensional well, one-third coming from each independent coordinate for motion (i.e., 
"degree of freedom"). The one in parentheses indicates that this level is nondegenerate. 
The next level is produced when one of the quantum numbers n has a value of two 
while the others have values of one. There are three independent ways of doing this; 
therefore, the second level is triply degenerate, and E5(3) = 6^? /8m L?. Proceeding, 
E4(3) 291? /8m L?, E4(3) 2 MA? /8m L?, Es(1) 2 124?/8m L?, Eg(6) = 14h? /8m L?, 
etc. Apparently, the energy level scheme and degeneracies of these levels do not proceed 
in the regular manner which is found in the one-dimensional cases we have studied. 


EXAMPLE 2-8 Verify that Eg is six-fold degenerate. 


SOLUTION » Eg = 14A? /8m L?, so n2 + n? + n2 = 14. There is only one combination of 
integers that satisfies this relation, namely 1, 2, and 3. So we simply need to deduce how many 
unique ways we can assign these integers to ny, ny, and nz. There are three ways to assign |. For 
each of these three choices, there remain but two ways to assign 2, and then there is only one way 
to assign 3. So the number of unique ways is 3 x 2 x 1 — 6. (Or one can simply write down all of 
the possibilities and observe that there are six of them.) 4 


We shall now briefly consider what probability distributions for the particle are 
predicted by these solutions. The lowest-energy solution has its largest value at the 
box center where all three sine functions are simultaneously largest. The particle 
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Figure 2-16 » Sketches of particle probability distributions for a particle in a cubical box. 
(a) ny 2 ny — nz —l. (b) ny 22, nyc nz— l. (c) ny ny nz —2. 


distribution is sketched in Fig. 2-16a. The second level may be produced by setting 
ny —2, and ny —n; = 1. Then there will be a nodal plane running through the box 
perpendicular to the x axis, producing the split distribution shown in Fig. 2-16b. Since 
there are three ways this node can be oriented to produce distinct but energetically 
equal distributions, this energy level is triply degenerate. The particle distribution for 
the state where ny = ny = nz —2 is sketched in Fig. 2-16c. It is apparent that, in the 
high energy limit, the particle distribution becomes spread out uniformly throughout 
the box in accord with the classical prediction. 

The separation of variables technique which we have used to convert our three- 
dimensional problem into three independent one-dimensional problems will recur in 
other quantum-chemical applications. Reviewing the procedure makes it apparent that 
this technique will work whenever the hamiltonian operator can be cleanly broken into 
parts dependent on completely different coordinates. This is always possible for the 
kinetic energy operator in cartesian coordinates. However, the potential energy operator 
often prevents separation of variables in physical systems of interest. 

It is useful to state the general results of separation of variables. Suppose we have 
a hamiltonian operator, with associated eigenfunctions and eigenvalues: 


Hyi = Ei (2-65) 
Suppose this hamiltonian can be separated, for example, 


H (a, B) = Ha (a) + Hg (B) (2-66) 
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where o and £ stand for two different coordinates or groups of coordinates. Then it 
follows that 


V jk = fj (Œœ) gx CB) (2-67) 
where 
Ay f; =a; fj (2-68) 
and 
Hp gk = bk gk (2-69) 
Furthermore, 
Ejy—aj + be (2-70) 


In other words, if a hamiltonian is separable, then the eigenfunctions will be products 
of eigenfunctions of the subhamiltonians, and the eigenvalues will be sums of the 
subeigenvalues. 


E 2-8 The Scattering of Particles in One Dimension 


Consider the potential shown in Fig. 2-17a. We imagine that a beam of particles, each 
having energy E, originates from the left and travels toward x = oo, experiencing a 
constant potential every where except at the potential step at x — 0. We are interested in 
what becomes of these particles—what fraction makes it all the way to the “end” (some 
kind of particle trap to the right of the step) and what fraction is reflected back toward 
x — —oo. Problems of this type are related to scattering experiments where electrons, 
for example, travel through potential jumps produced by electronic devices or through 
a dilute gas where potential changes occur in the neighborhood of atoms. 

This kind of problem differs from most of those discussed earlier because the particle 
is not trapped (classically), so all nonnegative energy values are possible. We already 
know what the form of the eigenfunctions is for the constant potentials to the left and 
right of the step for any choice of E. On the left they are linear combinations of 
exp(Ei /2m Ex/h), where i = h/27 , and to the right they are linear combinations 
of exp(+i./2m(E — U)x/h). The only thing we do not yet know is which linear 
combinations to take. That is, we need to find A, B, C, and D in 


Viet = Aexp(ikx/h)+ Bexp(—ikx/h), x«0, k=V2mE (2-71) 
Wright = Cexp(ik’x/h) + Dexp(—ik’x/h), x>0, k'-—y2m(E—U) (2-72) 


We have seen earlier that the exponentials having positive arguments correspond to 
particles traveling from left to right, etc. We signify this with arrows in Fig. 2-17a. 
The nature of Wright is qualitatively different depending on whether E is larger or 
smaller than the step height U. If E « U, the exponential arguments become real. 
One of the exponentials decays and the other explodes as x increases, just as we saw 
in Section 2-3. We discard the exploding exponential. The decaying exponential on 
the right must now be made to join smoothly onto Wie, at the step. That is, Weg and 
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A ——————— C ——————— 
u 
2 
0 —————— X 
0 
(a) 
u 
2 
0 —— X 
0 
(b) 
1 1 
[BP Ice 
jal? a}? 
0 0 
0 1 0 1 


(c) (d) 


Figure 2-17 » (a) A potential step of height U at x =0. (b) A wavefunction having E = U/2. 
(c) Fraction of particles reflected as a function of E. (d) Fraction of particles transmitted as a function 
of E. (Note: The vertical line at left of parts (a) and (b) is not a barrier. It is merely an energy 
ordinate.) 


Wright must have the same value and slope at x = 0 (Fig. 2-17b). This means that 
(fch2:eqn2-72 must have real value and slope at the step, which forces it to be a 
trigonometric wave. Because there is no additional boundary condition farther left, this 
trigonometric wave can always be shifted in phase and amplitude to join smoothly onto 
the decaying exponential at the right. (Compare Fig. 2-17b to Fig. 2-11.) The final 
values of A and B are simply those that give the appropriate phase and amplitude. 

The ratio A* A/ B* B is the relative fluxes of particles traveling toward the right or 
left in the region to the left of the step. If | 4| =| B], the fluxes are equal, corresponding 
to total reflection of the beam from the step potential. 

It is not difficult to show (Problem 2-24) that | 4| = |B| whenever Weg has real 
value and slope at any point, i.e., for any trigonometric wave, and so the potential of 
Fig. 2-17a gives total reflection if E < U. (The fact that some particle density exists 
at x > 0 due to barrier penetration does not affect this conclusion. The evaluation of 
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extent of reflection assumes that a time-independent (steady-state) situation has been 
achieved, so the penetration population remains constant and none of the new particles 
entering from the left are “lost” due to barrier penetration.) 

The situation changes when we consider cases for E > U. For any such E, we now 
have two acceptable exponential functions on both the right and the left. We proceed 
by realizing that the function with coefficient D in Wright should be rejected since it 
corresponds to particles traveling from right to left, i.e., to particles that have been 
reflected from the trap. But we assume the trap to be 10046 effective, so once again we 
have only one term in Yignt As before, we set about forcing the values and slopes of 
Eqs. (2-71) and (2-72) (with D — 0) to be equal at x — 0. This time, however, there is 
no requirement that these values be real. We arrive at the relations (Problem 2-25) 


B k-k C 2k 


A k a Gee J 

The extent of reflection is equal to 
Bè (xy 4 
AP EK? E 


This can be seen to range from zero, when k’ =k, to one, when k’ — 0. k’ =k when E = 
E — U, i.e., when U is negligible compared to E. So zero reflection (total transmission) 
is approached in the high-energy limit. Only when E = U does k’ =0, so total reflection 
occurs only when E equals the barrier height (or, as we saw previously, is lower). 
A plot of the fraction of particles reflected versus E/U appears in Fig. 2-17c. The 
transmission, equal to 1.0—reflection, is plotted in Fig. 2-17d. 

The approach represented by this scattering problem is to identify the two terms 
that can contribute to the wavefunction in each region; then to recognize that one of 
the terms in one region is lost, either because it explodes or because it corresponds 
to reflection from the particle trap; then to force a smooth junction at the position of 
the discontinuity in the potential; and finally to draw conclusions about reflection and 
transmission from the values of the absolute squares of the coefficients. Notice that, 
for E > U, we could just as well have postulated the beam to be coming from the 
right, with the trap at the left. This would lead us to set 4 — 0 in Eq. (2-71). Even in 
cases like this, where the particles are passing over the edge of a potential cliff, there 
is backscatter (Problem 2-26). 

An additional feature appears when we consider potentials that change at two points 
in x, as in Fig. 2-18a. The solution now involves three regions and two places (x = +a) 
where y and dy/dx must be made equal. As before, we decide where the particle 
source and trap are and set one coefficient equal to zero (say G). Detailed solution 
of this problem is tedious? For our purposes, it is the nature of the result that is 
important. The extent of transmission as a function of E/U is plotted in Fig. 2-18b. 
There are two obvious ways this differs from the step-potential transmission function 
of Fig. 2-17d: First, some of the particles are transmitted even when their energy is less 
than U. This is the result of barrier penetration leading to finite particle density at the 
right-hand side of the barrier transmission due to tunneling. (The extent of tunneling 
transmission depends on the thickness of the barrier.) Second, there are oscillations 


3See Merzbacher [2, Chapter 6]. 
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Fraction transmitted 


Elu 


Figure 2-18 » (a) A rectangular-hill potential of height U and width 2a. (b) Fraction of beam 


transmitted as a function of E/U, where U = h? [21?ma?. 


in the transmission function for E > U, with 100% transmission occurring at intervals 
in E instead of only in the infinite limit. These come about because of interference 
between waves reflecting off the front and back edges of the barrier. This is most easily 
understood by recognizing that 100% transmission corresponds to no reflection, so then 
B — 0. This occurs when the wave reflecting back from x = —a is of opposite phase 
to that reflecting back from x = +a, and this happens whenever there is an integral 
number of de Broglie half-wavelengths between x — —a and a. At energies where this 
happens, the beam behaves as though the potential barrier is not there. 

The variation of reflection from thin films (e.g., soap bubbles) of light of different 
wavelengths results in the perception of colors and is a familiar example of scattering 
interference. Less familiar is the variation in reflection of a particle beam, outlined 
above. However, once we recognize the wave nature of matter, we must expect particles 
to manifest the same sort of wave properties we associate with light. 


[p 2-9 Summary 


In this chapter we have discussed the following points: 


1. A particle constrained in the classical sense (i.e., lacking the energy to overcome 
barriers preventing its motion over the entire coordinate range) will have quantized 
energy levels and a finite zero-point energy. In the mathematical analysis, this arises 
from requirements on y at boundaries. 
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V can be nonsmooth, or cusped, where V is infinite at a point. If V is infinite over 
a finite range, Y must be zero there. 


Nondegenerate eigenfunctions of H must be symmetric or antisymmetric for any 
operation that leaves H unchanged. 


|Y |? may be regarded as a statistical measure—a summary of many measurements 
of position on independent, but identically prepared, systems. 


Quantum-mechanical predictions approach classical predictions in the limits of 
large E, or large mass, or very high quantum number values. 


Integrals with antisymmetric integrands must vanish. 


|V/|? does not vanish in regions where V > E if V is finite. This is called “barrier 
penetration." 


One-dimensional motion ofa free particle has a continuum of energy levels. Except 
for E — 0, the states are doubly degenerate. Therefore, any mixture of such a 
pair of states is still an eigenfunction of H. But only two eigenfunctions (for a 
given E £0) are also eigenfunctions for the momentum operator. These are the 
exponential functions. Since they correspond to different momenta, mixing them 
produces functions that are not eigenfunctions for the momentum operator. 


Motion of a particle on a ring has quantum-mechanical solutions very similar to 
those for free-particle motion in one dimension. In both cases, there is no zero-point 
energy. Both are doubly degenerate for E > 0 because two directional possibilities 
are present. Both have a set of exponential solutions that are eigenfunctions for 
momentum. The main difference is that the particle-in-a-ring energies are quan- 
tized, due to head-to-tail “joining conditions" on y. 


Increasing the dimensionality of a particle's range of motion increases the number 
of quantum numbers needed to define the wavefunctions. In cases where the 
hamiltonian operator can be written as a sum of operators for different coordinates 
(i.e., is "separable"), the problem greatly simplifies; the wavefunctions become 
products, and the energies become sums. 


Scattering problems are treated by selecting an energy of interest from the con- 
tinuum of possibilities, removing functions that describe nonphysical processes 
such as backscatter from the trap, and matching wave values and slopes at region 
boundaries. Resulting wavefunctions show wave interference effects similar to 
those observed for light. 


2-9.A Problems 


2-1. Ascertain that the expression (2-12) for energy has the proper dimensions. 


2-2. Solve Eq. (2-9) for A. 


2-3. There is a simple way to show that A in Eq. (2-9) must equal J/2/ LL. It involves 


sketching /?, recognizing that sin? x + cos? x = 1, and asking what A must equal 
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2-4. 


2-6. 


2-8. 


2-9. 
2-10. 


in order to make the area under y? equal 1. Show this for n = 1, and argue why 
it must give the same result for all n. 


Evaluate the probability for finding a particle in the middle third of a one- 
dimensional box in states with n = 1,2, 3, 104. Compare your answers with the 
sketches in Fig. 2-5 to see if they are reasonable. 


. a) Estimate the probability for finding a particle in the n — 1 state in the line 


element Ax centered at the midpoint of a one-dimensional box if Ax =0.01L. 
How does this compare to the classical probability? 

b) Repeat the problem, but with Ax centered one third of the way from a box 
edge. 


a) Use common sense to evaluate the following integral for the particle in a 
one-dimensional box, assuming that w is normalized. 
L/S 
Vidx 
0 


b) How does this value compare to that for the integral over the same range, but 
using v instead of vs? (Larger, smaller, or equal?) Use a sketch to defend 
your answer. 


. Let S and A be respectively symmetric and antisymmetric functions for the 


operator R. Evaluate the following, where R operates on every function to its 
right: (a) RS (D) RA (c) RSS (d) RAA (e) RAS (f) RAASASSA (g) RAASASAA. 
Can you think of a simple general rule for telling when a product of symmetric 
and antisymmetric functions will be antisymmetric? 


Using the concept of odd and even functions, ascertain by inspection of sketches 
whether the following need be identically zero: 


a) fj sin cos 6 dé 

b) (7, sin cos0 dé 

c) Juge cos x dx 

d) f, cos ysin? ydy 
e) Jo sin? 0 cos? 0 d0 
Dus sin? 0 cos? 0 dO 
g) fi fA xh ydxdy 
h) f7; xsinxcosxdx 
i) ie sinx A sin? x dx 
j JZ sin? x 5- sin x dx 
Verify Eq. (2-23) for the general case n £m by explicit integration. 


For the potential of Fig. 2-8, when E < U the energies are discrete, and when 
E > U, they are continuous. Is there a solution with E =U? What special 
requirements are there, if any, for such a solution to exist? 
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2-11. 


2-12. 


2-13. 


2-14. 


2-15. 


2-16. 
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Figure P2-12 » 


Prove the following statement: any linear combination of degenerate eigenfunc- 
tions of H is also an eigenfunction of H. 


In a few words, indicate what is wrong with the wavefunctions sketched in the 
potentials shown in Fig. P2-12. If the solution appears to be acceptable, indicate 
this fact. 


A double-well potential ranges from x — 0 to x — 2L and has a thin (width — 
0.01 L) rectangular barrier of finite height centered at L. 


a) Sketch the wavefunction that goes with the fourth energy level in this system, 
assuming that its energy is less than the height of the barrier. 
b) Estimate the energy of this level for a particle of mass m. 


Use the simple approach presented in Problem 2-3 to demonstrate that 4 — 
1/ 7 for the trigonometric particle-in-a-ring eigenfunctions and 1 /./2z for the 
exponential eigenfunctions. 


1/2 


Explain why (27)~!/? exp(i4/29) is unacceptable as a wavefunction for the 


particle in a ring. 


For a particle in a ring, an eigenfunction is Y = (1/4/7r) cos(39). 


a) Write down H. 

b) Evaluate Hy and identify the energy. 

c) Is this a state for which angular momentum is a constant of motion? Demon- 
strate that your answer is correct. 
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2-17. 


2-18. 


2-19. 


2-20. 


2-21. 


Consider two related systems—a particle in a ring of constant potential and 
another just like it except for a very thin, infinitely high barrier inserted at $ = 0. 
When this barrier is inserted, 


a) are any energies added or lost? 

b) do any degeneracies change? 

c) are exponential and sine-cosine forms both still acceptable for eigenfunc- 
tions? 

d) is angular momentum still a constant of motion? 


Consider a particle of mass m in a two-dimensional box having side lengths Ly 
and L, with Ly —2L, and V =0 in the box, oo outside. 


a) Write an expression for the allowed energy levels of this system. 

b) What is the zero-point energy? 

c) Calculate the energies and degeneracies for the lowest eight energy levels. 

d) Sketch the wavefunction for the fourth level. 

e) Suppose V = 10 J in the box. What effect has this on (i) the eigenvalues? 
(11) the eigenfunctions? 


Consider the particle in a three-dimensional rectangular box with Ly = Ly = 
L,/2. What would be the energy when n, — 1,5, —2,n; —2? Forny; =1, 
ny=1,n,=4? Can you guess the meaning of the term “accidental degeneracy?” 


Consider a particle of mass m in a cubical box with V 20at0 «x, y,z < L. 


a) Is (1 / V2) (V... — Was) an eigenfunction for this system? Explain your 
reasoning. 

b) Estimate the probability for finding the particle in a volume element AV = 
0.0017 at the box center when the system is in its lowest-energy state. What 
is the classical value? 


Kuhn [1] has suggested that the mobile x electrons in polymethine dyes can be 
modeled after the one-dimensional box. Consider the symmetric carbocyanine 
dyes (I) where the positive charge "resonates" between the two nitrogen atoms. 
The zigzag polymethine path along which the z electrons are relatively free to 
move extends along the conjugated system between the two nitrogens. Kuhn 
assumed a box length L equal to this path length plus one extra bond length 
on each end (so that the nitrogens would not be at the very edge of the box 
where they would be prevented from having any ;r-electron charge). This 
gives L = (2n + 10)! where / is 1.39 A, the bond length of an intermediate 
(i.e., between single and double) C-C bond. The number of x electrons in the 
polymethine region is 2n + 10. Assume that each energy level in the box is 
capable of holding no more than two electrons and that the electronic transition 
responsible for the dye color corresponds to the promotion of an electron from 
the highest filled to the lowest empty level, the levels having initially been 
filled starting with the lowest, as shown in Fig. P2-13. Calculate AE and A 
for the cases n — 0, 1, 2, 3 and compare with the observed values of maximum 
absorption of about 5750, 7150, 8180, and 9250 A, respectively. 
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2-22. 


2-23. 


2-24. 


2-25. 
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Figure P2-13 » 


Show whether momentum in the x direction is a constant of motion for a free par- 
ticle of mass m in states described by the following functions. In cases where it is 
aconstant of motion, give its value. In all cases, evaluate the kinetic energy of the 
particle. 


a) vy —sin3x 
b) y =exp(3ix) 
c) y —cos3x 


d) w=exp(—3ix) 


Show whether angular momentum perpendicular to the plane of rotation is a 
constant of motion for a particle of mass m moving in a ring of constant potential 
in states described by the following functions. In cases where it is a constant of 
motion, give its value. In all cases, evaluate the kinetic energy of the particle. 


a) y — (1/1/71) cos 3¢ 
b) y — 0/42z) exp —3i) 


c) y — (1/ /7) sin3¢ 
d) y — (1//2z) exp(3id) 


Demonstrate that the requirement that y = A exp(ikt) + B exp(—ikt) have real 
value and slope at a point in x suffices to make | 4| = ||. 


Derive relations (2-73) and (2-74) by matching wave values and slopes at the step. 
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2-26. Solve the problem for the step potential shown in Fig. 2-17a, but with the beam 
traveling from a source at right toward a trap at left. 


2-27. The reflection coefficient is defined as | B|? / | A|? for a beam originating from the 
left of Fig. 2-17a. The transmission coefficient is defined as (k’/k) |C P? / |A|?. 


a) Why is the factor k’/k needed? 
b) Show that the sum of these coefficients equals unity, consistent with a 
steady-state situation. 


2-28. For scattering from a potential such as that in Fig. 2-18a, 100% transmission 
occurs at various finite particle energies. Find the lowest two values of E/U 
for which this occurs for particles of mass m, barrier width d, and barrier height 
U =2h? [n?md". 


2-29. Calculate “frequencies” in cm~! needed to accomplish the transitions A E, and 
A E» in Fig. 2-14. 


Multiple Choice Questions 


(Try to answer these by inspection.) 


1. The integral jee cos(x) sin(x) dx 


a) equals zero for any value of a, and cos(x) is antisymmetric in the range of the 
integral. 

b) is unequal to zero except for certain values of a, and cos(x) is symmetric in the 
range of the integral. 

c) equals zero for any value of a and cos(x) is symmetric in the range of the integral. 

d) is unequal to zero except for certain values of a, and sin(x) is antisymmetric in 
the range of the integral. 

e) equals zero for any value of a, and sin(x) is symmetric in the range of the integral. 


P is x sin(x) cos(x) dx 


Which one of the following statements is true about the above integral and the three 
functions in its integrand? 


a) All three functions are antisymmetric in the range and the integral equals zero. 

b) Two functions are antisymmetric and one is symmmetric in the range, and the 
integral is unequal to zero. 

c) Two functions are symmetric and one is antisymmetric in the range, and the 
integral is equal to zero. 

d) One function is symmetric, one is antisymmetric, and one is unsymmetric in the 
range, and the integral is unequal to zero. 

e) None of the above is a true statement. 
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3. In solving the particle in a one-dimensional box problem with infinite repulsive 
walls at x — 0 and L, we started with the function A sin(kx) + B cos(kx). Which 
one of the following is a true statement? 


a) The value of k is found by requiring that the solution be normalized. 

b) Adding C exp(ikx) to the above function would prevent it from being an eigen- 
function of the hamiltonian operator. 

C) It is necessary that this function equal L when x — 0. 

d) The boundary condition at x = L is used to show that B — 0. 

e) None of the above is a true statement. 


4. It is found that a particle in a one-dimensional box of length L can be excited from 
the n = 1 to the n — 2 state by light of frequency v. If the box length is doubled, the 
frequency needed to produce the n = 1 to n = 2 transition becomes 


a) v/4 
b) v/2 
c) 2v 
d) 4v 
e) None of the above is correct. 


5. Fora particle in a one-dimensional box with infinite walls at x — 0 and L, and in the 
n —3 state, the probability for finding the particle in the range 0 < x < L/4 is 


a) greater than 1/3. 

b) exactly 1/6. 

c) exactly 1/3. 

d) less than 1/6. 

e) None of the above is correct. 


6. A student calculates the probability for finding a particle in the left-most 1096 of a 
one-dimensional box for the » — 1 state. Which one of the following answers could 
be correct? 


a) 1.742 
b) 0.024 
c) 0.243 
d) 0.100 
e) None of the above is reasonable. 


7. Which one of the following statements is true about the particle in a one-dimensional 
box with infinite walls? (All integrals range over the full length of the box.) 


a) f UE dx — 0 because these wavefunctions are orthogonal. 

b) f Via dx =0 because both of these wavefunctions are symmetric. 
c) f V1 dx — 0 because these wavefunctions are normalized. 

d) f ViV3dx =0 because these wavefunctions are orthogonal. 

e) None of the above is a true statement. 
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8. 


9. 


10. 


11. 


12. 


Which one of the following is the correct formula for the lowest-energy eigenfunc- 
tion for a particle in a one-dimensional box having infinite barriers at x = — L /2 
and L/2? 


a) V E sin(orx/L) 
b) V 2 cos(orx/L) 
c) J? expanx/L) 


d) J} expinx/L) 
e) None of the above is correct. 


A particle is free to move in the x dimension without constraint (i.e., under the 
influence of a constant potential, which we assume to be zero). For this system, 
the wavefunction y (x) = exp(3.4x) is not acceptable because 


a) itis not an eigenfunction of the hamiltonian operator. 
b) itis multi-valued. 

c) it is discontinuous. 

d) it approaches infinity as x approaches infinity. 

e) it goes to zero as x approaches minus infinity. 


Consider two identical one-dimensional square wells connected by a finite bar- 
rier. Which one of the following statements about the quantum-mechanical time- 
independent solutions for this system is true when two equivalent “half-solutions” 
in the two wells are joined together to produce two overall solutions? 


a) The combination that is symmetric for reflection through the central barrier 
always has the lower energy of the two. 

b) The combination that places a node in the barrier always has the lower energy 
of the two. 

c) The sum of the two half-solutions always has the lower energy of the two. 

d) The difference of the two half-solutions always has the lower energy of the two. 

e) None of the above is a true statement. 


For a single particle-in-a-ring system having energy 9/? /8z? I we can say that the 
angular momentum, when measured, will equal 


a) 3h 

b) V12h 

c) either 34 or —3h 

d) zero 

e) None of the above is a true statement. 


A particle in a ring has wavefunctions that 


a) result from placing an integral number of half-waves in the circumference of 
the ring. 

b) must be eigenfunctions for (^/2zi)d /dó. 

c) are all doubly degenerate, due to two rotational directions. 
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13. 


14. 


15. 
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d) correspond to energies that increase with the square of the quantum number. 
e) None of the above is a true statement. 


The hamiltonian operator for a system is 
H 25 —(h?/81?m)V? - x? + y?  Z 
For this system we should expect 


a) two quantum numbers at most. 

b) eigenfunctions that are sums of functions, each depending on only one of the 
variables. 

c) eigenvalues that are products of eigenvalues of separated equations. 

d) eigenvalues that are sums of eigenvalues of separated equations. 

e) None of the above is a correct statement. 


For a particle in a one-dimensional box with one infinite barrier and one finite 
barrier of height U, 


a) w —O at both barriers if E is less than U. 

b) barrier penetration is smallest for the lowest-energy state. 

c) no more than one quantized state can exist, and its energy is less than U. 
d) only states having E greater than U can exist. 

e) None of the above statements is correct. 


For a particle of mass m in a cubical box having edge length L, 


a) the zero point energy is 34?/8m L?. 

b) the probability density has its maximum value at the box center for W211. 
c) the degeneracy of the ground state is 3. 

d) W111 has three nodal planes. 

e) None of the above is a true statement. 
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Chapter 3 


The One-Dimensional Harmonic 
Oscillator 


NH 3-1 Introduction 


In Chapter 2 we examined several systems with discontinuous potential energies. In this 
chapter we consider the simple harmonic oscillator—a system with a continuously vary- 
ing potential. There are several reasons for studying this problem in detail. First, the 
quantum-mechanical harmonic oscillator plays an essential role in our understanding 
of molecular vibrations, their spectra, and their influence on thermodynamic properties. 
Second, the qualitative results of the problem exemplify the concepts we have presented 
in Chapters 1 and 2. Finally, the problem provides a good demonstration of mathemat- 
ical techniques that are important in quantum chemistry. Since many chemists are not 
overly familiar with some of these mathematical concepts, we shall deal with them in 
detail in the context of this problem. 


N 3-2 Some Characteristics of the Classical 
One-Dimensional Harmonic Oscillator 


A pendulum consisting of a large mass hanging by an almost weightless wire, and 
swinging through a very small angle, is a close approximation to a classical harmonic 
oscillator. It is an oscillator since its motion is back and forth over the same path. 
It is harmonic to the extent that the restoring force on the mass is proportional to the 
horizontal component of the displacement of the mass from its rest position. This force 
law, known as Hooke's law, is the common first approximation made in the analysis of 
a system vibrating about an equilibrium position. If we let the x axis be the coordinate 
of displacement of the mass, with x = 0 as the rest position, then we may write the 
restoring force as 


F--kx, (3-1) 


where k is the force constant. The minus sign assures that the force on the displaced 
mass is always directed toward the rest position. 

We can use this force expression to determine an equation of motion for the mass, 
that is, an equation relating its location in space, x, to its location in time, f: 


d? x (t) 


F = —kx(t)= ma =m d? 


(3-2) 
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or 


d?x(t) 
dB T (—k/ m)x(t) (3-3) 
According to this equation, x (f) is a function that, when differentiated twice, is regen- 
erated with the multiplier —k/m. A general solution is 


x(t) =asin (2 ‘ + bcos (2 Jj (3-4) 
m m 


If we require that x(t) be at its maximum value L at t — 0 (as though the pendulum 
is held at its position of maximum displacement and then released at t = 0), then it 
follows that b = L. Since the pendulum is also motionless at t = 0, (dx (t)/dt):=0 — 0, 


and so a — 0. Hence, 
k 
x (t) = Lcos (y — ) (3-5) 
m 


Thus, the equation of motion [Eq. (3-3)] leads to a function, x(t), that describes the 
trajectory of the oscillator. This function has the trigonometric time dependence char- 
acteristic of harmonic motion. From this expression, we see that x (f) repeats itself 
whenever the argument of the cosine increases by 27r. This will require a certain time 
interval ¢’. Thus, the pendulum makes one complete back and forth motion in a time ¢’ 


given by 
k 
ja? Son d'a. (3-6) 
m k 


so the frequency of the oscillation v is 


, 1l [jk 
v=1/t =—,/— (3-7) 
2x Vm 
Suppose that one were to take a multiflash photograph of a swinging pendulum from 
above. The result would look as shown in Fig. 3-1a, the number of images being much 
greater near the termini of the swing (called the “turning points") than at the middle 
because the pendulum is moving fastest as it crosses the middle. This, in turn, results 
from the fact that all the potential energy of the mass has been converted to kinetic 
energy at the middle point. We thus arrive at a classical prediction for the time-averaged 
distribution of the projection of the harmonic oscillator in the displacement coordinate: 
This distribution function is greatest in regions where the potential energy is highest 
(Fig. 3-1b) (Problem 3-1). 
Let us calculate and compare the time-averaged potential and kinetic energies for the 
classical harmonic oscillator. When the particle is at some instantaneous displacement 
x', its potential energy is 


V (x^) = (applied force times distance to return to x — 0) 


d 1 12 
= kx dx = -kx (3-8) 
0 2 
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(a) 


(b) 


Figure 3-1 » (a) Results of a uniform multiflash photograph of a swinging pendulum as pho- 
tographed from above. (b) Distribution function corresponding to the continuous limit of discrete 
distribution shown in (a). 


The cumulative value of the potential energy over one complete oscillation, Ve, is given 
by the integral 


Ü Ü 
Ve (-0)=f V (t) dt= ; f x (0? dt (3-9) 
0 0 


Substituting for x (f) as indicated in Eq. (3-5) 


Ü 
Ve (t —0) 5 zr f cos? (\ M ) dt 
0 
Ü 
= skank | cos? (2 ) d (s ) (3-10) 
2 0 m m 


When t —t', /k/mt = 27, and we may rewrite Eq. (3-10) as 


1 27 
V. (t -0) — EL? nt | cos? ydy = (1/2) KL? /m/k (3-11) 
0 
If we now divide by /' to get the average potential energy per unit time, we find 


gue 2 2 
= V, (t' — 0) _ (@/2)KL? m]k _ kL (3-12) 


t' 27r«/m/k 4 


Thus, we have the average potential energy. If we knew the total energy, which is a 
constant of motion, we could get the average kinetic energy by taking the difference. 
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It is easy to evaluate the total energy by taking advantage of its constancy over time. 
Since we can choose any point in time to evaluate it, we select the moment of release 
(t — 0 and x = L). The mass is motionless so that the total energy is identical to the 
potential energy: 


I 2 
E=5kL (3-13) 


Comparing Eqs. (3-12) and (3-13) we see that, T =kL*/4. On the average, then, the 
classical harmonic oscillator stores half of its total energy as potential energy and half 
as kinetic energy. 


EXAMPLE 3-1 A 10.00 g mass on a Hooke's law spring with force constant k = 
0.0246 N m7! is pulled from rest at x = 0 to x = 0.400 m and released. 


a) What is its total energy? 
b) What is its frequency of oscillation? 
c) How do these answers change if the mass weighs 40.00 g? 


SOLUTION » a) E- T +V. Atx 20.400m, T 20, E = V =kx2/2 =0.50(0.0246N m7!) 
(0.400 m)? = 0.00207 


1/2 -1\ 1/2 

od fk _ 1 (0.0246Nm m -1 

b) v (4) -x( 0.0100 kg ) =0.25s 
-1 


c) E is unchanged, and v is halved to 0.125 s^. The energy depends on the force constant and 
the displacement of the oscillator, but not on the mass of the oscillator. The frequency depends not 
only on the force constant, but also on the mass because the frequency is affected by the inertia of 
the oscillator. A greater mass has a lower frequency of oscillation for the same force constant. «4 


E 3-3 The Quantum-Mechanical Harmonic Oscillator 


We have already seen [Eq. (3-8)] that the potential energy of a harmonic oscillator is 
given by 


T.i 
V(x)— ake (3-14) 


so we can immediately write down the one-dimensional Schródinger equation for the 
harmonic oscillator: 


| cies wo? ds 4 d V) = Ev (x) (3-15) 


The detailed solution of this differential equation is taken up in the next section. 
Here we show that we can understand a great deal about the nature of the solutions to 
this equation by analogy with the systems studied in Chapter 2. 

In Fig. 3-2a are shown the potential, some eigenvalues, and some eigenfunctions for 
the harmonic oscillator. The potential function is a parabola [Eq. (3-14)] centered at 
x =Q and having a value of zero at its lowest point. For comparison, similar information 
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(a) (b) 


(c) 


Figure 3-2 » The potential function, energy levels, and wavefunctions for (a) the harmonic oscil- 
lator, (b) the particle in the infinitely deep square well. (c) y? for the harmonic oscillator in the state 
n=5. 


is graphed in Fig. 3-2b for the particle in a box with infinitely high walls. Some 
important features of the harmonic oscillator are: 


1. The energy-level spacing for the harmonic oscillator is constant. As mentioned 
in Chapter 2, we expect the energy levels for the harmonic oscillator to diverge 
less rapidly than those for the square well because the higher energy states in the 
harmonic oscillator have effectively larger “boxes” than do the lower states (that is, 
the more energetic the oscillator, the more widely separated are its classical turning 
points). That the spacing for the oscillator grows less rapidly than that for the box 
is therefore reasonable. That it is actually constant is something we will show 
mathematically in the next section. 


2. The wavefunctions for the harmonic oscillator are either symmetric or antisymmetric 
under reflection through x =0. This is necessary because the hamiltonian is invariant 
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to reflection through x — 0 and because the eigenfunctions are nondegenerate. Both 
of these conditions apply also to the box problem. If we imagine moving from the 
box potential to the parabolic potential by a process of continuous deformation, the 
symmetry is not altered and there is no reason to expect nondegenerate box levels 
to come together and become degenerate. Therefore, the oscillator wavefunction 
symmetries are not surprising. As a consequence, we have at once that the symmetric 
wavefunctions are automatically orthogonal to antisymmetric wavefunctions, as 
discussed earlier. (Actually, all the wavefunctions are orthogonal to each other. This 
is proved in Section 3-4.) 


3. The harmonic oscillator has finite zero-point energy. (The evidence for this in 
Fig. 3-2a is the observation that the line for the lowest (n = 0) energy level lies 
above the lowest point of the parabola, where V = 0.) This is expected since the 
change from square well to parabolic well does not remove the restrictions on particle 
position; it merely changes them. 


4. The particle has a finite probability of being found beyond the classical turning 
points; it penetrates the barrier. 'This is to be expected on the basis of earlier 
considerations since the barrier is not infinite at the classical turning point. (The 
potential becomes infinite only at x = +00.) 


5. In the lowest-energy state the probability distribution favors the particle being in 
the low-potential central region of the well, while at higher energies the distribution 
approaches more nearly the classical result of favoring the higher potential regions 
(Fig. 3-2c). 


N 3-4 Solution of the Harmonic Oscillator 
Schrödinger Equation 


3-4.A Simplifying the Schrodinger Equation 
Equation (3-15) is simplified by substituting in the following relations: 
a = 81?mE/ I? (3-16) 
B? = An?mk[ h? (3-17) 
The quantities œ and f have units of m^?. We will assume that f is the positive root 


of 87. The quantity œ is necessarily positive. The Schrödinger equation now can be 
written 


d^y (x) 


qx (a — 82x?) w(x) =0 (3-18) 


3-4.B Establishing the Correct Asymptotic Behavior 


At very large values of |x|, the quantity o (which is a constant since E is a con- 
stant) becomes negligible compared to ?x?. That is, the Schrödinger equation (3-18) 
approaches more and more closely the asymptotic form 


d^y (x)/dx? 2 Bx? W(x), |x| > oo (3-19) 
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What we need, then, are solutions w(x) that approach the solutions of Eq. (3-19) at 
large values of |x|. The solutions for Eq. (3-19) can be figured out from the general 
rule for differentiating exponentials: 


(d/dx)exp(u(x)) 2 exp(u(x))du(x)/dx (3-20) 
Then 
(d? /dx”) exp(u(x)) = [(du(x)/dx)? + d?u(x)/dx?] exp(u(x)) (3-21) 


We want the term in square brackets to become equal to ?x? in the limit of large |x]. 
We can arrange for this to happen by setting 


u(x) = +Bx?/2 (3-22) 
for then 
(d? dx?) exp(u(x)) = (8?x? + p) exp(+Bx/2) (3-23) 


At large values of |x|, B is negligible compared to 6*x*, and so exp(+Bx7/2) are 
asymptotic solutions for Eq. (3-19). As |x| increases, the positive exponential increases 
rapidly whereas the negative exponential dies away. We have seen that, for the wave- 
function to be physically meaningful, we must reject the solution that blows up at large 
|x|. On the basis of these considerations, we can say that, if y contains exp(— x? /2), 
it will have the correct asymptotic behavior if no other term is present that dominates 
at large |x|. Therefore, 


w(x) =q(x) exp(—Bx7/2) (3-24) 


and it remains to find the function g(x). 


3-4.C The Differential Equation for q(x) 
Substituting Eq. (3-24) into the Schrédinger equation (3-18) gives 


d d? 
d ! E dg c» |-o (3-25) 


exp(- fa/2) | Bq (x) — 26x 
This equation is satisfied only if the term in brackets is zero: 


d*q(x) dq (x) 
dx? NECS dx 


Thus, we now have a differential equation for q (x). 
At this point it is convenient to transform variables to put the equation into a simpler 
form. Let 


+ (a — B)q (x) =0 (3-26) 


y- / Bx (3-27) 
Then 


d/dy=d/d (vix) = (1 IV B) djdx (3-28) 
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so that 
d/dx — Bd [dy (3-29) 
Similarly 
d^ pd? 
dd gy (3-30) 
and 
TRA (3-31) 


Substituting Eqs. (3-29)-(3-31) into (3-26), and defining f(y) as 
fone f (VBx) =a) (3-32) 


we obtain (after dividing by £) 


d? f (y) ,,Z£ O9 
dy? » dy 


+ [(a@/B) - 1] f(y) 20 (3-33) 


3-4.D Representing f as a Power Series 


Now f(y) is some function of y that must be single valued, continuous, and smooth 
(i.e., have a continuous first derivative), if y is to be properly behaved. Can we think 
of any functions that satisfy these properties? Of course, we can think of a limitless 
number of them. For example, 1, y, y^, 2 yt, etc., are all single valued, continuous, 
and have continuous derivatives, and so is any linear combination of such functions (e.g., 
4y? — y+3). Other examples are sin( y) and exp( y). These functions can be expressed 
as infinite sums of powers of y, however, and so they are included, in principle, in the 
first example. Thus 


sin(y) = y — y /312- /51 — y Jn +- (3-34) 
and 
exp(y) =D) y [n! (3-35) 
n=0 


that is, sin(y) and the set of all positive powers of y are linearly dependent. Because the 
powers of y can be combined linearly to reproduce certain other functions, the powers 
of y are called a complete set of functions. However, we must exercise some care with 
the concept of completeness. The positive powers of y cannot be used to reproduce a 
discontinuous function, or a function with discontinuous derivatives. Hence, there are 
certain restrictions on the nature of functions g( y), that satisfy the relation 


v= > oy" (3-36) 
n=0 
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These restrictions define a class of functions, and the powers of y are a complete set 
only within this class. The positive powers of y, then, form a complete set, but if we 
remove one of the members of the set, say 1 (the zero power of y), then the set is no 
longer complete. This means that the remaining members of the set cannot compensate 
for the role played by the missing member. In other words, the missing member cannot 
be expressed as a linear combination of the remaining members. In this example 


14 ‘> ay (3-37) 
n=1 


This is easily demonstrated to be true since the left-hand side of Eq. (3-37) is unity 
whereas the right-hand side must be zero when y= 0 for any choice of coefficients. 
Thus, the powers of y are linearly independent functions (no one of them can be 
expressed as a linear combination of all the others). 

The function f(y) involved in w should be a member of the class of functions for 
which the powers of y form a complete set. Therefore, we may write 


fe dy (3-38) 


n=0 


and seek an expression for the unknown multipliers cn. 


3-4.E Establishing a Recursion Relation for f 


Notice that if 


fM=cotaytayr o es y? t cay! +: (3-39) 
then 

df(y)/dy — ei*-2e2y 303 y? t Aca y? + (3-40) 
and 

d? fO) dy = 2c, 4 2.3e3y - 3 4c4 y) ++ (3-41) 


Thus, substituting Eq. (3-38) into (3-33) gives 


1-2c5 -2:3c3y 4-3-4c4 y? 4-264 y - 2:205? — 2:303? —--- 
+[(a@/B) — 11 co + a/b) — 11e1y 4- [(/B) — Noy +: 
=0 (3-42) 


Now we will take advantage of the fact that the various powers of y form a linearly 
independent set. Equation (3-42) states that the expression on the LHS equals zero 
for all values of y. There are two ways this might happen. One of these is that minus 
the constant part of the expression is always exactly equal to the y-dependent part, no 
matter what the value of y. This would require a relationship like Eq. (3-37) (except 
with an equality), which we have seen is not possible for independent functions. The 
remaining possibility is that the various independent parts of Eq. (3-42) are individually 
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equal to zero—the constant is zero, the coefficient for y is zero, etc. This gives us a 
whole set of equations. Setting the constant term equal to zero gives 


2c? + [(a/B) — 110020: m0 (3-43) 
Setting the coefficient for the first power of y to zero gives 


2 -3c3 —2e4 + [(v/B) —1]e; =0--- m-—l (3-44) 


The y? term gives 

3.4c4 — 2-2c2 4- [(v/B) —1]e2 —0--- m=2 (3-45) 
By inspecting this series, we can arrive at the general result 

(m + 1)(m + 2)em42 + [(/B) — 1 — 2m]eg =0 (3-46) 


or 


—-[(/B) — 2m — 1] 
(m + 1)(m + 2) 


Cm+2 = Cm (3-47) 

Equation (3-47) is called a recursion relation. If we knew co, we could produce 
C2, C4, C6, etc., by continued application of Eq. (3-47). Similarly, knowledge of c 
would lead to c3, cs, etc. Thus, it appears that the coefficients for even powers of y 
and those for odd powers of y form separate sets. Choosing co determines one set, 
and choosing cı determines the other, and the choices for cg and cı seem independent. 
This separation into two sets is reasonable when we recall that our final solutions 
must be symmetric or antisymmetric in x, hence also in y. The asymptotic part of w, 
exp(— Bx?/2), is symmetric about x — 0, and so we expect the remainder of Y, f(y), to 
be either symmetric (even powers of y = ./Bx) or antisymmetric (odd powers). Thus, 
we can anticipate that some of our solutions will have co Z 0, c2 £0, c4 £0, ... and 
c1 — 03 — c5 =---=0. This will produce symmetric solutions. The remaining solutions 
will have co = c2 = c4 = - - - = 0 and c1 Z0, c3 Z0, ..., and be antisymmetric. 


EXAMPLE 3-2 Evaluate c3 and cs if cı = 1, for arbitrary œ and B. What ratio a/B 


will make c3 — 0? What ratio «/ will make c5 — 0 but c3 40? 


SOLUTION » c,=1,Forc3,weusem=1, c4— ee x1- 30/8. 
For cs we use m =3, ¢5= n8 1] y, [3-o/Bl _ O= olf yb a/p) 
o / B = 3 makes c3 — 0 (and also c5). a/B = ce c5 = 0, but not c3. < 


3-4.F Preventing f(y) from Dominating the Asymptotic Behavior 


We now examine the asymptotic behavior of f(y). Recall that, at very large values 
of |y|, f(y) must become insignificant compared to exp(—Bx?/2) = exp(— y*/2). We 
will show that f(y) fails to have this behavior if its power series expression is infinitely 
long. That is, we will show that f(y) behaves asymptotically like exp(y’), which 
dominates exp(— y*/2). 
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We know that the series expression for exp(y) is 


4 6 n +2 
2 goes y y y' 
=1+y 4 j Fed | Fee 3-48 
exp) RS ECT (n/2)  (n/23-1)! rode) 
The series for f(y) has terms 
ib en Y" + Ong 2 A + ga y + (3-49) 


The ratio between coefficients for two adjacent terms high up in the series (large n) for 
exp()?) is, from Eq. (3-48) 


coeff for y^^? (n/2)! ] — lagen 2 
= = > (3-50) 
coeff for y” [(n/2)+1]! (/2)+1 n 


and for f(y) it is, from Eqs. (3-49) and (3-47), 


coeff for y^? cn42 — (@/B) +1422 largen 2 (3-51) 
= = > S 
coeff for y" Cn n?+3n+2 n 


This means that, at large values of y, when the higher-order terms in the series dominate, 
f Cy) behaves like exp( y’). Then 


Jim WO) = lim. £O) exp(—y"/2) = exp(y*/2) — oc (3-52) 


The asymptotic behavior of w is ruined. We can overcome this problem by requiring 
the series for f(y) to terminate at some finite power. In other words, f(y) must be 
a polynomial. This condition is automatically fulfilled if any one of the coefficients 
in a given series (odd or even) is zero since Eq. (3-47) guarantees that all the higher 
coefficients in that series will then vanish. Therefore, we require that some coefficient 
vanish: 


€n42 —0 (3-53) 
Assuming that this is the lowest zero coefficient (1.e., cn 40), Eq. (3-47) gives 
(a/B) — 2n - 120 (3-54) 
Or 


o = B(2n +1) (3-55) 


3-4.G The Nature of the Energy Spectrum 


Now we have a recipe for producing acceptable solutions for the Schródinger equation 
for the harmonic oscillator. If we desire a symmetric solution, we set c; = 0 and co = 1. 
If we want the polynomial to terminate at y", we require that o and £ be related as in 
Eq. (3-55). In this way we can generate an unlimited number of symmetric solutions, 
one for each even value of n that can be chosen for the terminal value. Similarly, an 
unlimited set of antisymmetric solutions results from setting co = 0 and cı = 1 and 
allowing the highest contributing value of n to take on various odd integer values. 
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(Solved Problem 3-2 provides an example by showing that cs becomes the first odd- 
power-coefficient equal to zero if a/B — 7.) 

Since we now know that an acceptable solution satisfies Eq. (3-55), we can exam- 
ine the energy spectrum. Substituting into Eq. (3-55) the expressions for œ and f 
[Eqs. (3-16) and (3-17)], we obtain 


8x?m EJ I? — (2t / mkJ h) Qn +1) (3-56) 


Or 
E=h(n+3) (5) Vin (n) hv (3-57) 


where the classical definition of v [Eq. (3-7)] has been used. 

This result shows that, whenever n increases by unity, the energy increases by hv, so 
the energy levels are evenly spaced as shown in Fig. 3-2. At absolute zero, the system 
will lose its energy to its surroundings insofar as possible. However, since n = 0 in the 
lowest permissible state for the system, there will remain a zero-point energy of ih V. 


3-4.H Nature of the Wavefunctions 


The lowest energy solution corresponds to n — 0. This means that co is the highest 
nonzero coefficient in the power series expansion for f(y). Hence, we must set c? — 0. 
(The odd-powered series coefficients are all zero for this case.) Thus, for n = 0, we can 
write the unnormalized wavefunction as [from Eq. (3-24)] 


Wo = co exp(—y*/2) = coexp( — x? /2) (3-58) 


This is just a constant times a Gauss error function or simple “gaussian-type” function. 
This wavefunction is sketched in Fig. 3-2 and is obviously symmetric. 


The next solution has n = 1, c1 £0 but c3 = cs = --- =0. (The even-powered series 
coefficients are all zero for this case.) The unnormalized wavefunction for n — 1 is then 
vi(y) =ciyexp(— »^/2) (3-59) 


The exponential is symmetric and y is antisymmetric, and so their product, y(y), is 
antisymmetric (Fig. 3-2). 

To get yn we need to use the recursion relation (3-47). We know that odd-index 
coefficients are all zero and that only co and c» of the even-index coefficients are 
nonzero. Assuming co = 1, we have (using (Eq. 3-47) with m = 0) 

2-0-1 —1 
MEC 1, __@/B) S 
(1)(2) 2 
But the ratio œ/£ is determined by the requirement that c4 =0. Referring to Eq. (3-55), 
this gives (for n —2) «/B = 5, and so 


c2 = —4/2 = —2 (3-61) 
and the unnormalized wavefunction is 


p(y) = (1 —2y”) exp - 5?/2) (3-62) 
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Polynomials like (1 — 25?), which are solutions to the differential equation (3-33), 
are known as Hermite (her-meet) polynomials, H,(y). In addition to the recursion 
relation (3-47), which we have derived, other definitions are known. One of these 
involves successive differentiation: 

d" exp(—y*) 
H, zen) ——— 4 
n(y) = (71). exp") dy 
Thus, if we want H»(y), we just set n = 2 in Eq. (3-63) and evaluate that expression to 
get 


(3-63) 


Hy(y) =4y" -2 (3-64) 


which differs from our earlier result by a factor of —2. Yet another means of producing 
Hermite polynomials is by using the generating function 


G(y, u) «exp? — (u — y)’1= 3 G1 O)/nDu” (3-65) 
n=0 


We use this expression as follows: 
1. Express the exponential in terms of its power series, writing down a few of the leading 
terms. There will exist, then, various powers of u and y and factorial coefficients. 


2. Collect together all the terms containing u?. 


3. The coefficient for this term will be equal to A2 (y)/2. 


This is a fairly clumsy procedure for producing polynomials, but Eq. (3-65) is 
useful in determining general mathematical properties of these polynomials. For 
instance, Eq. (3-65) will be used in showing that the harmonic oscillator wavefunctions 
are orthogonal. 


3-4.1 Orthogonality and Normalization 


We will now show that the harmonic oscillator wavefunctions are orthogonal, i.e., that 


+00 oo 
Wn Ym) dy = Hn (y) Hm (y)exp(—y*) dy=0--- n#m (3-66) 


—oo0 


Consider the integral involving two generating functions and the exponential of 5?: 


+00 
J G(y, u)G(y, v) exp(—y*)dy 


** H.(y)H, 
22323 2! nOn Oe (oy) dy (3-67) 
2 a nim! 
Nt 
Cnm 


where we label the integral Cam for convenience. The left-hand side of Eq. (3-67) may 
also be written as [using Eq. (3-65)] 


+00 toS 
/ exp[—(y — u — v)?] expQuv)d y = exp(2uv) | exp[-(y — u — v)*|dy 
S (3-68) 
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However, we can add constants to the differential element without affecting the integral 
value, and u and v are constants when only y varies. Therefore, (3-68) becomes (see 
Appendix 1 for a table of useful integrals) 


+00 
exp(2uv) | exp[—(y — u — v?]d(y — u — v) = expQuv) /zt 
—o0o0 
= n 1--2uv + 4u?v? [21 + 89 v? /33 p. 2^ u" v /n| +} (3-69) 
This expression is equal to the right-hand side of Eq. (3-67). Comparing Eq. (3-67) 
with (3-69), we see that c11 = 2/7 since the term u! v! is multiplied by 27 in Eq. (3-69) 


and by c1; in Eq. (3-67). Similarly c22 = 44/7 /2! But c12 =0. Hence, we arrive at the 
result 


exp (-»?) dy 2 J (2" /n!) ôn,m (3-70) 


č -f^ Hy (y) Hm (y) 
Mic e n'm! 


(ôn, m is the “Kronecker” delta. It is a discontinuous function having a value of unity 
when n =m but zero when n Z m.) So 


+00 
Vn (VY) Vm Cy) dy= At m2" 5, m (3-71) 


This proves the wavefunctions to be orthogonal and also provides us with a normalizing 
factor. Normality refers to integration in x, rather than in y= ./Bx, so we must change 
the differential element in Eq. (3-71): 


+00 +00 
J V2 (y) dy - /B J V2 (y) dx = Vn" (3-72) 


Requiring that 1 y? (y) dx = 1 leads to the expression for the normalized wave- 
functions: 


1/2 
1 

v o- (5; ) H,()exp(-»/2,  n=0,1,2,... (3-73) 
m 2"n! 


The first members of the set of Hermite polynomials are 


Ho()-1, My)=2y, Fh(y)-4y'—-2, H(y-8y-12y 
H) 216y* — 48y? +12, Hs) 232? — 1605? + 120y (3-74) 


Section 3-5 Quantum-Mechanical Average Value of the Potential Energy 83 


3-4.J Summary of Solution of Harmonic-Oscillator 
Schródinger Equation 


The detailed solution is so long that the reader may have lost the broad outline. 
The basic steps were: 


1. Determine the asymptotic behavior of the Schródinger equation and of y. This 
produces a gaussian factor exp(— y?/2) times a function of y, f(y). 


2. Obtain a differential equation for the rest of the wavefunction, f(y). 


3. Represent f(y) as a power series in y, and find a recursion relation for the coefficients 
in the series. The symmetries of the wavefunctions are linked to the symmetries of 
the series. 


4. Force the power series to be finite (i.e., polynomials) so as not to spoil the asymptotic 
behavior of the wavefunctions. This leads to arelation between o and £ that produces 
uniformly spaced, quantized energy levels. 


5. Recognize the polynomials as being Hermite polynomials, and utilize some of the 
known properties of these functions to establish orthogonality and normalization 
constants for the wavefunctions. 


EXAMPLE 3-3 Which ofthe following expressions are, by inspection, unacceptable 
eigenfunctions for the Schródinger equation for the one-dimensional harmonic 
oscillator? 


a) (64y° — 48074 + 7205? — 120) exp()? /2) 
b) (64y° — 4805? + 7205? — 120) exp( — y? /2) 
c) (6456 — 480^ + 7205? — 120) exp( y*/2) 


SOLUTION » a) is unacceptable because exp(? /2) blows up at large | y]. 
b) is unacceptable because the polynomial contains terms of both even and odd powers. 
C) is acceptable. < 


[| 3-5 Quantum-Mechanical Average Value of 
the Potential Energy 
We showed in Section 3-2 that the classical harmonic oscillator stores, on the average, 
half of its energy as kinetic energy, and half as potential. We now make the analogous 


comparison in the quantum-mechanical system for the ground (n = 0) state. 
The wavefunction is 


Wo(x) = (B/1)! ^ exp(- 8x? /2) (3-75) 


and the probability distribution of the particle along the x coordinate is given by Ur (x). 
The total energy is constant and equal to 


Eg— shy = (h/4t)/k/m (3-76) 
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and the potential energy as a function of x is 
Log 
V(x)- j" (3-77) 


The probability for finding the oscillating particle in the line element dx around some 
point xı is Va (x1) dx, since Wo of Eq. (3-75) is normalized. Hence, the average value 
for the potential energy is simply the sum of all the potential energies due to all the 
elements dx, each weighted by the probability for finding the particle there: 


oo +00 
V =f [prob. to be in dx][V at dx] dx =f PEV (x) dx (3-78) 


This is 


+00 
P= (p/m) zf exp (~pa?) x? dx = VB] - 5k n] (3-79) 


where we have referred to Appendix 1 to evaluate the integral. Using the definition of 
B? (Eq. 3-17) we have 


V —k/AB = (kJ4) -hJ Qx V mk) = (h/8x) / k/ m (3-80) 


which is just one half of the total energy [Eq. (3-76)]. This means that the average 
value of the kinetic energy must also equal half of the total energy, since V + T = E. 
We thus arrive at the important result that the ratio of average potential and kinetic 
energies is the same in the classical harmonic oscillator and the ground state of the 
quantum-mechanical system. This result is also true for the higher states. For other 
kinds of potential, the storage need not be half and half, but whatever it is, it will be the 
same for the classical and quantum-mechanical treatments of the system. We discuss 
this point in more detail later when we examine the virial theorem (Chapter 11 and 
Appendix 8). 


[J 3-6 Vibrations of Diatomic Molecules 


Two atoms bonded together vibrate back and forth along the internuclear axis. The 
standard first approximation is to treat the system as two nuclear masses, mı and m2 
oscillating harmonically with respect to the center of mass. The force constant k is 
determined by the “tightness” of the bond, with stronger bonds usually having larger k. 

The two-mass problem can be transformed to motion of one reduced mass, m, vibrat- 
ing harmonically with respect to the center of mass. u is equal to mım2/(m1 + m»). 
The force constant for the vibration of the reduced mass remains identical to the force 
constant for the two masses, and the distance of the reduced mass from the center 
of mass remains identical to the distance between m; and m». Thus we have a very 
convenient simplification: We can use the harmonic oscillator solutions for a single 
oscillating mass u as solutions for the two-mass problem. All of the wavefunctions and 
energy formulas are just what we have already seen except that m is replaced by u. The 
practical consequence of this is that we can use the spectroscopically measured energy 
spacings between molecular vibrational levels to obtain the value of k for a molecule. 
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EXAMPLE 3-4 There is a strong absorption in the infrared spectrum of H?CI at 
2992 cm-!, which corresponds to an energy of 5.941 x 107% J. This light energy, 


E, is absorbed in order to excite HCI from the n — 0 to the n = 1 vibrational state. 
What is the value of k, the force constant, in HCl? 


SOLUTION » The vibrational spacing hv must be equal to 5.941 x 10~2°J. We know that, 
since u replaces m, v = (1/2z.) /k/p, which means that k = 4x2 E? u/h?. The formula for u 
is mymc)/(mpy + mc = 1.614 x 10727 kg. It follows that k is equal to 4:2 (5.941 x 10-20]? 
(1.614 x 10727kg)/(6.626 x 10734 J s)? =512N ml. 4 


E 3-7 Summary 


In this chapter we have discussed the following points: 


1. 


al 


The energies for the quantum-mechanical harmonic oscillator are given by the for- 
mula E,=(n+1/2)hv, n=0,1,2,..., where v = (1/27)./k/m. This gives 
nondegenerate energy levels separated by equal intervals (Av) and a zero-point 
energy of Av/2. 


The wavefunctions for this system are symmetric or antisymmetric for reflection 
through x =0. This symmetry alternates as n increases and is related to the presence 
of even or odd powers of y in the Hermite polynomial in y. 


Each wavefunction is orthogonal to all of the others, even in cases where the sym- 
metries are the same. 


The harmonic oscillator wavefunctions differ from particle-in-a-box wavefunctions 
in two important ways: They penetrate past the classical turning points (1.e., past 
the values of x where E = V), and they have larger distances available to them as a 
result of the opening out of the parabolic potential function at higher energies. This 
gives them more room in which to accomplish their increasing number of wiggles 
as n increases, and so the energy does not rise as quickly as it otherwise would. 


The manner in which the total energy is partitioned into average potential and kinetic 
parts is the same for classical and quantum-mechanical harmonic oscillators, namely, 
half and half. 


Vibrations in molecules are usually approximately harmonic. Mass is replaced by 
reduced mass in the energy formula. Measuring the energy needed to excite a 
molecular vibration allows one to calculate the harmonic force constant for that 
particular vibrational mode. 


3-7.A Problems 


3-1. From the equation of motion (3-5) show that the classical distribution function is 


proportional to (1 — x? / L31. 
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3-2. 


3-3. 


3-4. 


3-5. 


3-8. 


3-9. 


3-10. 
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A classical harmonic oscillator with mass of 1.00 kg and operating with a force 
constant of 2.00 kg s7? = 2.00 J m"? is released from rest at  — 0 and x = 
0.100 m. 


a) What is the function x(t) describing the trajectory of the oscillator? 
b) Where is the oscillating mass when t = 3 seconds? 

c) What is the total energy of the oscillator? 

d) What is the potential energy when t = 3 seconds? 

e) What is the time-averaged potential energy? 

f) What is the time-averaged kinetic energy? 

g) How fast is the oscillator moving when t —3 seconds? 

h) Where are the turning points for the oscillator? 

1) What is the frequency of the oscillator? 


Find the expression for the classical turning points for a one-dimensional har- 
monic oscillator in terms of n, m, h, and k. 


a) Equation (3-73) for v, (y) is a rather formidable expression. It can be broken 
down into three portions, each with a certain purpose. Identify the three parts 
and state the role that each plays in meeting the mathematical requirements 
on y. 


b) Produce expressions for the normalized harmonic oscillator with n — 0, 1, 2. 


Operate explicitly on Wo with H and show that Wo is an eigenfunction having 
eigenvalue /1v/2. 


. a) At what values of y does y? have a node? 


b) At what values of y does ws have its maximum value? 


. Give a simple reason why (2 + y — 3y”)exp(—y*/2) cannot be a satisfactory 


wavefunction for the harmonic oscillator. What about 2yexp(+y*/2)? You 
should be able to answer by inspection, without calculation and without reference 
to tabulations. 


Consider the function (32x? — 160x? + 120x) exp(—x? /2). 


a) How does this function behave at large values of +x? Explain your answer. 
b) What can you say about the symmetry of this function? 
c) What are the value and slope of this function at x = 0? 


Let f(x) 23cosx 4-4. f(x)isexpressed as a powerseriesinx: f(x) =) cnx”, 
with n =0,1,2,.... 


a) What is the value of cg? 
b) What is the value of c1? 


Only one of the following is H5(y), a Hermite polynomial. Which ones are not, 
and why? 


a) 165? + 130y 
b) 245? — 1105? 4-90y — 18 
c) 32? — 1605? + 120y 
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3-11. 


3-12. 


3-13. 


3-14. 


3-15. 


3-16. 


3-17. 


3-18. 


3-19. 


3-20. 


3-21. 


3-22. 


3-23. 


Sketch the function (1 — 2x?) exp( —x?) Versus x. 


Given that f" x exp(—x?) dx = 1/2, and {5° x? exp(—x?) dx = 7/4, evaluate 


a) f^ x exp(—x*) dx 
b) E S exp(—x?) dx 


Evaluate f^. (x + 4x3) exp( 5x?) dx. 


For the n = 1 state of the harmonic oscillator: 


a) Calculate the values of the classical turning points. 

b) Calculate the values of the positions of maximum probability density. 

c) What is the probability for finding the oscillator between x = 0 and x = oo? 

d) Estimate the probability for finding the oscillator in a line increment Ax equal 
to 1% of the distance between classical turning points and centered on one of 
the positions of maximum probability density. 


Calculate the probability for finding the ground state harmonic oscillator beyond 
its classical turning points. 


Use the differential expression (3-63) for Hermite polynomials to produce H2 (y). 
Use the generating function (3-65) to produce H»(y). 


Write down the Schródinger equation for a three-dimensional (isotropic) har- 
monic oscillator. Separate variables. What will be the zero-point energy for 
this system? What will be the degeneracy of the energy level having a value of 
(9/2)hv? (5/2)hv? Sketch (roughly) each of the solutions for the latter case 
and note their similarity to case (b) in Fig. 2-16. 


Suppose V (x) = (1/2)kx? for x > 0, and oo for x <0. What can you say about 
the eigenfunctions and eigenvalues for this system? 


a) Evaluate H3(x) at x =2. 
b) Evaluate H(sin 0) at 0 = 30°. 


What is the average potential energy for a harmonic oscillator when n —5? What 
is the average kinetic energy? 


Each degree of translational or rotational freedom can contribute up to jR to the 
molar heat capacity of an ideal diatomic gas, whereas the vibrational degree of 
freedom can contribute up to R. Explain. 


Calculate the force constants for vibration in HPF, H?CI, H?! Br, and H?7T, 
given that the infrared absorptions for the n — 0 to n — 1 transitions are seen, 
respectively, at 4138, 2991, 2649, and 2308 cm !. What do these force constants 
imply about the relative bond strengths in these molecules? 
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Multiple Choice Questions 
(Try to answer these without referring to the text.) 


1. Which one ofthe following statements conflicts with the quantum mechanical results 
for a one-dimensional harmonic oscillator? 


a) The smaller the mass of the oscillating particle, the greater will be its zero-point 
energy, for a fixed force constant. 

b) The frequency is the same as that of a classical oscillator with the same mass and 
force constant. 

c) Increasing the force constant increases the spacing between adjacent energy 
levels. 

d) The spacing between adjacent energy levels is unaffected as the vibrational quan- 
tum number increases. 

e) The vibrational potential energy is a constant of motion. 


2. A quantum-mechanical harmonic oscillator 


a) spends most of its time near its classical turning points in its lowest-energy state. 
b) has y — 0 at its classical turning points. 

c) has doubly degenerate energy levels. 

d) has energy levels that increase with the square of the quantum number. 

e) None of the above is a correct statement. 


3. Light of wavelength 4.33 x 107 m excites a quantum-mechanical harmonic oscilla- 
tor from its ground to its first excited state. Which one of the following wavelengths 
would accomplish this same transition if 1) the force constant only was doubled? ii) 
the mass only was doubled? 


a) 4.33 x 1078 m 
b) 2.16 x 1076 m 
c) 3.06 x 1079. m 
d) 6.12 x 1076 m 
e) 8.66 x 1079 m 


4. For a classical harmonic oscillator, the probability for finding the oscillator in the 
middle 246 of the oscillation range is 


a) greater than 0.02. 

b) equal to 0.02. 

c) less than 0.02. 

d) unknown since it depends on the force constant. 
e) unknown since it depends on the amplitude. 


Chapter 4 


The Hydrogenlike lon, Angular 
Momentum, and the Rigid Rotor 


N 4-1 The Schródinger Equation and the Nature 
of Its Solutions 


4-1.A The Schródinger Equation 


Consider the two-particle system composed of an electron (charge —e) and a nucleus 
having atomic number Z and charge Ze. (See Appendix 12 for values of physical 
constants, such as e.) Let x1, yj, zı be the coordinates of the nucleus and x2, y», z2 
be those for the electron. The distance between the particles is, then, [(x1 — x)?-4 
Qi — 2)? + (zi 2 22)? ]!/7. The potential energy is given by the product of the charges 
divided by the distance between them. If e is expressed in coulombs, C, the potential 
energy in joules is 
2 
yo Ze = (4-1) 
4r eo [G1 — x2? + (Qn — 2)? + (1 — 22)? 


where go is the vacuum permittivity (8.8542 x 107? J^! C? m7!). The time- 
independent Schrédinger equation for this system is 


—h2 32 32 32 h2 32 32 32 
822M Vox Əy? az) 8z2m.Vax əy az 


Ze? 


W(X1, Y1, Z1, X2, Y2, Z2) 
1/2 
(zı — 2] : | 


4r £0 [Gr — x2)? + (1 — y? 4 
= Ex. y1, Z1, X2, V2, zo) (4-2) 


where M and m, are the masses of the nucleus and electron, respectively. The 
hamiltonian operator in brackets in Eq. (4-2) has three terms, corresponding to a 
kinetic energy operator for the nucleus, a kinetic energy operator for the electron, and 
a potential term for the pair of particles. 

Equation (4-2) has eigenfunctions y that are dependent on the positions of both the 
electron and the nucleus. It is possible to convert to center-of-mass coordinates and 
then to separate Eq. (4-2) into two equations, one for the motion of the center of mass 
and one for a particle of reduced mass moving around a fixed center to which it is 
attracted in exactly the same way the electron is attracted to the nucleus. Because this 
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conversion is rather tedious, we will not perform it in this book,! but merely discuss the 
results. The first of the two resulting equations treats the center of mass as a free particle 
moving through field-free space; its eigenvalues are simply translational energies of 
the ion. For us, the interesting equation is the second one, which is 


—h? ( 32 2» =) Ze " j 
i i xX, y, Z 
8m?u \ðx? ay? ƏZ) Amey (2+ y? 2)! 
= EW(x, y, 2) (4-3) 


The quantity u is the reduced mass for the particle in our center-of-mass system, and 
is given by 


u=m.M/(ms + M) (4-4) 


The coordinates x, y, and z are the coordinates of the reduced-mass particle with respect 
to the center of mass of the system. 

Even without going through the detailed procedure of converting to center-of-mass 
coordinates, we can show that Eq. (4-3) makes sense. In the idealized case in which M 
is infinitely greater than me, u equals me, and Eq. (4-3) becomes just the Schrödinger 
equation for the motion of an electron about a fixed nucleus at the coordinate origin. 
For real atoms or ions this would not be a bad approximation because, even in the 
case of the lightest nucleus (i.e., the hydrogen atom), M is nearly 2000 times me, and 
so u is very close to me, and the center of mass is very near the nucleus. Therefore, 
the result of using center-of-mass coordinates to separate the Schródinger equation is 
almost identical to making the assumption at the outset that the nucleus is fixed, and 
simply writing down the one-particle Schródinger equation: 


im ( CM =) a Wx, y z-EVG. yz) (4-5) 
| | x, yY, z) = XIZ = 
822me Vox? 3y? az? Ameo (x24 2422)? 


The use of me instead of u [i.e., Eq. (4-5) instead of (4-3)] has no effect on the qual- 
itative nature of the solutions. However, it does produce small errors in eigenvalues— 
errors that are significant in the very precise measurements and calculations of atomic 
spectroscopy (Problem 4-1). In what follows we shall use u, but for purposes of 
discussion we will pretend that the nucleus and center of mass coincide. 

Equation (4-3) can be transformed into spherical polar coordinates. (Some important 
relationships between spherical polar and Cartesian coordinates are given in Fig. 4-1). 
The result is 


[—4?/81?4)V? — (Ze? An eor) (r, 0, 6) = Ew(r, 0,0) (4-6) 


Here V? is understood to be in spherical polar coordinates. In these coordinates it 
looks quite complicated:? 


gel, b od nn 1 æ (4) 
= r } t = 
r? or ar r2 sin 6 30 80 r2sin?0 09? 


1 See, for example, Eyring et al. [1, Chapter VI] or Levine [2, pp. 127-130]. 
2See, e.g., Eyring et al. [1, Appendix III]. 
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x=rsin@ coso 


y7rsinO sing —o «x y, z <% 

z=rcos@ O<r<oo 
0c0«r 
O0<¢<27 


dv =r? sin 6 dr d0 dọ 


x 


Figure 4-1 » The spherical polar coordinate system. The angle ¢ is called the azimuthal angle. 
Notice that the differential volume element is not equal to dr d0 d$ and that the ranges of values for 
r, 6, @ are not —co to +00. 


However, this coordinate system is the natural one for this system and leads to the 
easiest solution despite this rather formidable looking operator. 

Notice that the potential term, — Ze? /4z &or, has no 0 or $ dependence. The potential 
is spherically symmetric. However, 0 and $ dependence does enter the hamiltonian 
through V, so the eigenfunctions y may be expected to show angular dependence. 

Next we describe the solutions of the Schródinger equation (4-6), relegating to later 
sections the mathematical details of how the solutions are obtained. 


EXAMPLE 4-1 Using the spherical polar coordinate system of Fig. 4-1, calculate 


the volume occupied by the skin of a spherical shell, where the inside radius of the 
skin is 100.0 mm and the thickness of the skin is 1.000 mm. 


SOLUTION » One way to solve this problem is to calculate the volume inside the entire sphere, 
including the skin, and then to subtract the volume of the sphere occupying the space inside the 
skin. The formula for the volume of a sphere of radius r can be calculated from dv by integrating 
r from 0 to r, 0 from 0 to x, and $ from 0 to 27: 


r 2 3t 2x r? 5 
y — Í r ar | sinodo | dọ = —|h : —cos0|7 No ul 
0 0 0 3 0 0 770 


a EMIL 
—(—(—1 — 1)) -2x = -xr 
3 3 


(You presumably already knew this formula, but it is useful to review how it comes out of this 
integration.) Proceeding, 
Vskin = V (r = 101 mm) — V (r = 100 mm) 


= 4 iao mm)? — (100 mm)?] = 1.269 x 10°mm? 
=; =1. 


Another way (slightly less exact) to approximate this volume is to calculate the area of the spherical 
shell (4:72) and multiply by its thickness: 


V —Anzr? Ar = Ax(100 mm? - 1.00 mm 
= 1.257 x 10° mm? 


For a skin whose thickness is small compared to its radius, we see that this approximation is 
very good. 4 
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4-1.B The Nature of the Eigenvalues 


The potential energy, —Ze?/4m sor, becomes negatively infinite when r = 0 and 
approaches zero as r becomes very large. This potential is sketched in Fig. 4-2 for the 
case in which Z — 1. We expect the energy levels to diverge less rapidly here than was 
the case for the harmonic oscillator since the "effective box size" increases more rapidly 
with increasing energy in this case than in the harmonic oscillator case. (See Fig. 2-3 
for the one-dimensional analogs.) Since the harmonic oscillator levels are separated by 
a constant (hv, for one- or three-dimensional cases), the hydrogenlike ion levels should 
converge at higher energies. Figure 4-2 shows that this is indeed the case. Furthermore, 
by analogy with the case of the particle in a box with one finite wall, we expect the 
allowed energies to form a discrete set for the classically trapped electron (E£ < 0) 
and a continuum for the unbound cases (E > 0). Thus, the spectrum of eigenvalues 
sketched in Fig. 4-2 is in qualitative accord with understandings developed earlier. 
The lowest allowed energy for the system is far above the low-energy limit (—0o) of 
the potential well. This corresponds to the finite zero point energy which we have seen 
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Figure 4-2 » The potential function V = -e /4zeor with eigenvalues superimposed (dashed 
lines). Degeneracies for the first few levels are noted on the right. 
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in other systems where particle motion is constrained. Here it means that, at absolute 
zero, the electron does not come to rest on the nucleus (which would give T = 0, 
V = —oo, E = —oo), but rather continues to move about with a finite total energy. 

All of the energy levels of Fig. 4-2 are degenerate except for the lowest one. The 
order of the degeneracy is listed next to each of the lowest few levels in Fig. 4-2. 
This degeneracy is not surprising since we are dealing here with a three- dimensional 
system, and we have earlier seen that, in such cases (e.g., the cubic box), the physical 
equivalence of different directions in space can produce degeneracies (called "spatial 
degeneracies"). We shall see later that some of the degeneracies in this system do 
indeed result from directional equivalence (here, spherical symmetry), whereas others 
do not. 

The discrete, negative eigenvalues are given by the formula 


—uZ?e Zz? 
0 


EN: NE (4-8) 
n 


n 


EXAMPLE 4-2 Calculate the ionization energy (Z E) of C?* in its ground state, in 


electron volts. 


SOLUTION » The ionization energy equals the negative of the ground state energy. Z — 6 and 
n= l, so IE = (13.6058 eV) 3f = 489.808 eV. < 


4-1.C The Lowest-Energy Wavefunction 


We will now discuss the lowest-energy eigenfunction of Eq. (4-6) in some detail, since 

an understanding of atomic wavefunctions is crucial in quantum chemistry. The deriva- 

tion of formulas for this and other wavefunctions will be discussed in later sections, 

but it is not necessary to labor through the mathematical details of the exact solution of 

Eq. (4-6) to be able to understand most of the essential features of the eigenfunctions. 
The formula for the normalized, lowest-energy solution of Eq. (4-6) is 


wr) = 0/A/2)( ao)? exp(—Zr/ao) (4-9) 


where ag = 5.2917706 x 107!! m (called the Bohr radius) and Ze is the nuclear charge. 
A sketch of w versus r for Z = 1 is superimposed on the potential function in Fig. 4-3a. 
It is apparent that the electron penetrates the potential barrier (Problem 4-3). 

The square of the wavefunction (4-9) tells us how the electron is distributed about 
the nucleus. In Fig. 4-3b is plotted w?(r) as a function of r. We refer to y? as the 
electron probability density function. In this case, the probability density is greatest at 
the nucleus (r — 0) and decays to zero as r becomes infinite. 

It is important for the chemist to be able to visualize the electron distributions, or 
charge “clouds,” in atoms and molecules, and various methods of depicting electron 
distributions have been devised. In Fig. 4-3 a few of these are presented for the lowest- 
energy wavefunction. The dot picture (Fig. 4-3c) represents what one would expect 
if one took a multiflash photograph of a magnified, slowed-down hydrogenlike ion 
(assuming no disturbance of the ion by the photographing process). Each dot represents 
aninstantaneous electron position, and the density of these dots is greatest at the nucleus. 


94 Chapter 4 The Hydrogenlike lon, Angular Momentum, and the Rigid Rotor 


vel N 77! 
0 r 
0 
Cusp 
Penetrates . . 
barrier 
Viet i . 
0 i Z-2 ee 
! . 
| 
i . 
| Classical " A 
v j turning point A 
0 r NE M 
0 
(a) (b) (c) 


(d) (e) (f) 


Figure 4-3 » (a) H-atom wavefunction superposed on —e /r potential curve. (b) Wavefunction 
squared for H and He*. (c) Dot picture of electron distribution. (d) Contour diagram of electron 
distribution. (e) Computer-generated graphic version of (d). (f) Single-contour representation of 
electron distribution. 


An alternative way of picturing the charge is to draw a contour diagram, each contour 
indicating that the density has increased or decreased by a certain amount (Fig. 4-3d). 
A more striking version of the contour plot for y? is shown in Fig. 4-3e. Perhaps the 
simplest representation is a sketch of the single contour that encloses a certain amount 
(say 90%) of the electronic charge (Fig. 4-3f). (We have been describing the electron 
as a point charge moving rapidly about the nucleus. However, for most purposes it is 
just as convenient to picture the electron as being smeared out into a charge cloud like 
those sketched in Fig. 4-3. Thus, the statements the electron spends 90% of its time 
inside this surface, and 9096 of the electronic charge is contained within this surface, 
are equivalent.) 

The multiflash “photograph” sketched in Fig. 4-3c shows the electron probability 
density to be greatest at the nucleus. Suppose that we were just to take a single flash 
photograph. Then the electron would appear as a single dot. At what distance from the 
nucleus would this dot be most likely to occur? Surprisingly, the answer is not zero. 
Despite the fact that the probability density is a maximum at r =Q, the probability for 
finding the electron in a volume element at the nucleus approaches zero. This is because 
the probability density, V/? (r), is the measure of the probability per unit volume for the 
electron being at various distances from the nucleus. When we compare a tiny volume 
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Figure 4-4 » The volume-weighted probability density for the lowest-energy eigenfunction of the 
hydrogenlike ion. The most probable value of r occurs at rmp. 


element near the nucleus with an identical one farther out, we see from Fig. 4-3 that 
there is indeed more likelihood for the electron being in the volume element nearer the 
nucleus. But there are more volume elements associated with the larger distance. (The 
number of identical volume elements varies as the area of the surface of the sphere, 
Azr?.) Hence, the probability for the electron being in a radial element dr at a given 
distance r from the nucleus is given by the number of volume elements at r times the 
probability density per unit volume element. The reason for the near-zero probability 
for finding the electron in a volume element at the nucleus is that the number of volume 
elements associated with r =Q is vanishingly small compared to the number associated 
with larger r values. Figure 4-4 is a graph of 4ztr?/?, the volume-weighted probability 
density. It is apparent that the most probable value of r, rmp, occurs at a nonzero 
distance from the nucleus. [The reader is familiar with analogous distinctions. Rhode 
Island has a higher population density than Texas, but the population of Texas (density 
times area) is greater. Again, matter in the universe has a much higher mass density 
in stars and planets than in intergalactic gas or dust, but the total mass of the latter far 
exceeds that of the former due to the much greater volume of “empty” space.] 


EXAMPLE 4-3 Estimate, for the hydrogen atom in the 1s state, the amount of 


electronic charge located in a spherical shell that is 1.00 pm thick and which has a 
radius of 60.0 pm. 


SOLUTION » Recall from Example 4-1 that, for relatively thin shells like this, we can 
estimate the volume of the shell by taking its area times its thickness. Also, we expect 
the charge density, y?, to change very little over the 1.00 pm thickness of the shell, so 
we can take its value at 60.0 pm as constant. Then charge density = y?r = 60.0 pm) = 
Ga exp(—2(60.0 pm)/ao). Recalling that aj = 52.9 pm, this gives 0.0329a9 ?. This gives 
density per cubic bohr. To proceed, we can either convert this to density per cubic picometer by 
dividing by (52.9 pm/bohr)? or by converting r to bohr, by dividing by 52.9 pm /bohr. We choose the 
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latter. Then r = 60.0 pm/52.9 pm/bohr = 1.13ag, and Ar = 1.00pm/52.9 pm/bohr = 0.0189ag, 
so volume of shell = 4x2 Ar = 47 (1.139)? (0.0189a9) = 0.302848. Volume times density 


= (0.30288) (0.0329a9 ?) = 0.010. So 1% of the electronic charge resides in this shell. < 


We can calculate the value of 7p by finding which r value gives the maximum value 
of 4xr?^i?. Recall that we can do this by finding the value of r that causes the first 
derivative of Amr? to vanish, that is, we require 


(d/dr)[4nr? n! (Z /a0)? exp( —2Zr/ag)] =0 (4-10) 
This gives 
constants - [2r — QZr?/ag)] exp(—2Zr/ag) =0 (4-11) 


The term in brackets vanishes when r = ao/Z, so this is the value of rmp. For 
Z=1, rmp=40; 4o is the most probable distance of the electron from the nucleus in 
the hydrogen atom. For the He* ion (Z = 2), the most probable distance is only half 
as great, consistent with a more contracted charge cloud. 

Of more interest, often, is the average value of the distance of the electron from 
the nucleus. If we could sample the instantaneous distance of the electron from the 
nucleus a large number of times and calculate the average value, what sort of result 
would we obtain? The probability for finding the electron at any given distance r is 
given by the volume-weighted probability density of Fig. 4-4. Inspection of that figure 
suggests that the average value of the position of the electron r is greater than 7mp, the 
most probable value. But exactly how much bigger is r than rmp? How should we 
compute the average value? The reader is familiar with the way an average test score 
is calculated from a collection of scores. For example, suppose the series of scores to 
be averaged is 0, 2, 6, 6, 7, 7, 7, 10, and that O and 10 are the minimum and maximum 
possible scores. The average is given by 


sum of scores 


average — 
number of scores 


| 04+2464+64+74+74+7+4+10 45 
B 8 8 


(4-12) 


Another way to write this is 


frequency of score x score 


average = : 
sum of frequencies 


| 1-0+0-141-24+0-34+0-440-54+2-6+3-7+0-8+0-9+1-10 
B 14+04+14+04+040424+3404+041 


or 


10 ph oed 
; fi fi. 
Bie (4-13) 


3:19, (frequency of i) 


The same idea is used to compute a quantum-mechanical average. For the average value 
of r we take each possible value of r times its frequency (given by V? dv) and sum 
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over all these values.? For a continuous variable like r, we must resort to integration to 
accomplish this. We divide by the “sum” of frequencies by dividing by f V? dv. Thus 


all space all space (4-14) 
f sp w2dv f P w2dv 


r= 
The denominator is unity since y, is normalized. The integrals over 6 and $ involve 


parts of the volume element dv, and not y?, because this wavefunction (4-9) does not 
depend on 0 or $. Continuing, 


oo 
F= Ql" - — cosb |7 a7 (Z/a | r? exp (—2Zr/ao) dr (4-15) 
0 


Utilizing the information in Appendix 1 for the integral over r, this becomes 


22 [-C- 1) + 11(1/2)(Z/ao)^3! 
(2Z/ao)* 


r= 


(4-16) 


= 4r -(1/m)(Z/ao)? - 6ag/16Z* = 22 (4-17) 
(It is useful to remember that integration over the $ and 0 parts of dv gives 4z as the 
result if no other angle-dependent functions occur in the integral.) Comparing (4-17) 
with our expression for rmp indicates that r is 1.5 times greater than rmp- 

Notice that the lowest-energy eigenfunction is finite at r = 0 even though V is infinite 
there. This is allowed by our arguments in Chapter 2 because the infinity in V occurs 
at only one point, so it can be cancelled by a discontinuity in the derivative of y. This 
is possible only if y has a corner or cusp at r — 0 (see Fig. 4-3a and e). 


4-1.D Quantum Numbers and Nomenclature 


There are three quantum numbers, n, /, and m (all integers), characterizing each solution 
of the Schródinger equation (4-6). Of these, only n enters the energy formula (4-8), 
so all solutions having the same values of n but different values of / and m will be 
energetically degenerate. As is shown in following sections, these quantum numbers 
are related in their allowed values. The / quantum number must be nonnegative and 
smaller than n. The m number may be positive, negative or zero, but its absolute value 
cannot exceed /. Thus, 


1—0,1,2,....n—1 (4-18) 
|m| x 1 (4-19) 


For the lowest-energy wavefunction we have already described, n = 1, so / =m — 0. 
No other choices are possible, so this level is nondegenerate. The convention (from 
atomic spectroscopy) is to refer to an / = 0 solution as an “s function,” or “s orbital.” 
(For / 2 0, 1, 2, 3, 4, 5, the spectroscopic designation goes s, p, d, f, g, h.) Because n 
equals unity, the wavefunction is labeled 1s. 


The 47r? part of Ax r?y? in Fig. 4-4 is included in dv, as will be seen shortly. 
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When n = 2, there are four sets of / and m quantum numbers satisfying rules (4-18) 
and (4-19). They are listed below with their spectroscopic labels: 


1=0,m=0 2s l=1,m=0 2po 


4-20 
l=l,m=-1 2p, J=l1,m=41 2p; ean) 


Extending these rules to the n = 3 energy level produces nine functions designated 
3s, 3p_1, 3po, 3p41, 3d_2, 3d_1, 3do, 3d+1, 3d42. In general, the degeneracy of the 
energy level characterized by n is n?. 


EXAMPLE 4-4 Explain how it comes about from the quantum number rules that 


the degeneracy equals n?. 


SOLUTION » The rules indicate that, for each value of n, there are n values of / (e.g., 
n=1,/=0;n=2,/=0and1,...), and each value of / is associated with 2/ + 1 m; values (e.g., 
1=2, m; = —2, —1,0, 1, 2, which is five values). Note that 2/ + 1 must be an odd number of 
member states, and the odd number within each set keeps increasing as / increases. Thus, for 
n —4,1-—0,1,2,3, leading to degenerate sets of states, respectively having 1, 3, 5, and 7 members. 
But a sequence of n odd numbers, starting from 1, is always equal to n?. QED 4 


4-1.E Nature of the Higher-Energy Solutions 


The second energy level is associated with the 2s, 2p_1, 2po and 2p, orbitals. The 
wavefunction for the 2s state is 


Vos — : (£) 0-2 )ex (==) (4-21) 
°° Ax ao ao P 2ao 


Since this is a function of r only, it is a spherically symmetric function. (In fact, all 
s orbitals are spherically symmetric.) The 2s orbital is more “spread out" than the 1s 
orbital because the exponential in y2, decays more slowly and because the exponential 
is multiplied by Zr/ao (the 2 becomes negligible compared to Zr/ao at large r). As a 
result, the charge cloud associated with the 2s orbital is more diffuse. (For this reason, 
when we approximate a polyelectronic atom like beryllium by putting electrons in 
1s and 2s orbitals the 2s electrons are referred to as "outer" and the 1s electrons are 
called “inner.” 

At small values of r, (2 — Zr/ag) is positive, and at large distances it is negative, 
SO V, has a spherical radial node (a zero in the r coordinate). In Fig. 4-5 the first 
three s orbitals are plotted. We see that the nth s orbital has (n — 1) spherical nodal 
surfaces dividing regions where the wavefunctions have different sign. The appearance 
of more and more nodes in the radial coordinate as the energy increases is certainly 
familiar from previous examples. Notice how the wavefunctions oscillate most rapidly 
and nodes are most closely spaced in the regions near the nucleus where the electron 
classically would have its greatest kinetic energy. 

The 2s orbital is orthogonal to the 1s orbital, and also to all higher s orbitals. This 
would not be possible if there were no radial nodes. The product vi,Vo, will vanish 
upon integration only if it either vanishes everywhere or else has positive and negative 
regions that cancel on integration. Since V1, and y, are almost everywhere finite, the 
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former condition does not occur. Since ys has the same sign everywhere, their product 
can have positive and negative regions only if V», has positive and negative regions, 
and hence, a node. 


Let us now consider the 2p functions. They are^ 
7 1 (5) Zr (s) l ao 
= ex cos - 
^m 7 A2 \ a ao P \ 2ap 
i 1 a Zr (2) E aos 
= ex sin 0 ex - 
"apr 8/7 Xao ao P 2ao pues 


All of these functions have the same radial exponential decay as the 2s orbital, so we 
can say that the 2s and 2p orbitals are about equal in size. However, since the 2p orbitals 
contain the factor Zr/ag where the 2s contains (2 — Zr/ag), the 2p orbitals vanish at 
the nucleus and not at any intermediate r value; they have no radial nodes. The 2p 
orbitals are endowed with directional properties by their angular dependences. The 
2po orbital is particularly easy to understand because the factors r cos 0 behave exactly 
like the z Cartesian coordinate. Hence, we can rewrite 2po (also called 2pz) in mixed 
coordinates as 


r 1 cR -Zr E 
2p, = —— | — zexp - 
^. 44/2z \ao 2a0 
Vis drr? Vi 
0 r 0 0 r 
Vs 4ar ED 
0 r 0 0 r 
Vas Ant p? 
0 r 0 0 T 


Figure 4-5 » s wavefunctions versus r and volume-weighted electron densities versus r for the 
hydrogenlike ion. 


^The + factor in ¥2p,, results from a phase factor that is omitted from many textbooks. It has no effect on 
our discussion here, but is consistent with an implicit choice of sign for certain integrals appearing in Appendix 4. 
See, e.g., Zare [4, Chapter 1]. 
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(Be careful not to confuse the atomic number Z with the coordinate z.) The exponential 
term in Eq. (4-24) is spherically symmetric, resembling a diffuse 1s orbital. The 
function z vanishes in the x y plane and becomes increasingly positive or negative as 
we move away from the plane in either direction. As a result, /»,. looks as sketched in 
Fig. 4-6. It has nearly spherical lobes, one with positive phase and one with negative 
phase. When yy, is squared, the contour lines remain unchanged in relative position, 
but the function becomes everywhere positive in sign. 

The 2p+1 orbitals are more difficult to visualize since they are complex functions. 
The charge distributions associated with these orbitals must be real, however. These 
are given by yY* y, where y* is the complex conjugate of y. (Recall that, for complex 
wavefunctions, V/*y must be used for probability distributions, rather than y?.) One 
obtains the complex conjugate of a function by merely reversing the signs of all the i's 
in the function. It is evident from Eq. (4-23) that Vp. = —yo»y , and Lo =— Yp: 
Hence Vp. V2p,, = V». Vp. , — —V2p,, Vp. ,: both the 2p, ; and the 2p. , orbitals 
give the same charge distribution. This distribution is the same as that for the 2p; 


orbital except that the angle dependence is 1 sin? 0 instead of cos?0. However, since 


sin? 0 + cos? 0 = 1, it follows that the sum of 2p+1 and 2p. .1 charge clouds must be such 


r4 Z 
D X x 
(a) (b) 
= + = 
2p 2p3, 2p, + 20*, 2P, 


(c) 


Figure 4-6 » (a) Drawing ofthe 2pz orbital. (b) Drawing of the square of the 2pz orbital. (c) Draw- 
ing of Vp, T Vp | Vap + Vp. V2p,, = spherically symmetric distribution. The curved lines in 
(c) are a visualization aid and are not mathematically significant. 
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that, when added to the charge cloud for 2po a spherical charge cloud results (since the 
angular dependence is removed). We already know that the 2p, distribution is dumb- 
bell shaped, so it follows that 2p.,.; and 2p. ; produce doughnut-shaped distributions. 
(A sphere minus a dumbbell equals a doughnut. See Fig. 4-6.) The shape can also be 
inferred from the sin? 0-dependence, which is a maximum in the x y plane. 

When solving the particle-in-a-ring problem, we saw that we could arrive at either a 
set of real trigonometric solutions or a set of complex exponential solutions. Since the ¢ 
dependence of the 2p. ; orbitals is identical to that for m = +1 solutions of the particle 
in the ring, the same situation holds here. Because yy, , and y , are energetically 
degenerate eigenfunctions, any linear combination of them is also an eigenfunction of 
the hamiltonian (Problem 2-11). Therefore, let us find linear combinations that are 


entirely real. The complex part of yp,,, exp(-EiQ), satisfies the relation 


exp(ti¢) —cos $ + i sin ó (4-25) 
so that 
exp(+i¢) + exp(—i$) =2cos@ (4-26) 
and 
i^! [expC-iQ) — exp(—i¢)] =2sing (4-27) 


Thus, we have two linear combinations of exp(+i¢) that are real. It follows immedi- 
ately that 


1 p uos Ds -Z 
Yop, = [ Wop_ — Pp, ]= : exp s sinô coso (4-28) 
Ep "4/20 \ao ao 2a0 


1 ZZ -7 
Vap, = — [Vp + ¥2p,,]= - exp -Jsinósinó (4-29) 
Y» 2 4/27 \ao ao 2ao 


where the factor 27 !/? is used to maintain normality. Since r sin 0 cos $ andr sin 0 sing 


are equivalent to the Cartesian coordinates x and y, respectively, Eqs. (4-28) and (4-29) 
are commonly referred to as the 2p, and 2p, orbitals. They are exactly like the 2p; 
orbital except that they are oriented along the x and y axes (merely replace the z in 
Eq. (4-24) with x or y). 

The 2s, 2px, 2p), and 2p; orbitals are all orthogonal to one another. This is easily 
shown from symmetry considerations. Each 2p orbital is antisymmetric for reflection 
in its nodal plane, whereas 2s is symmetric for all reflections. Hence, the product 
Vos Vp is always antisymmetric with respect to some reflection so its integral vanishes. 
The 2p functions are mutually orthogonal because, if one 2p orbital is antisymmetric 
for some reflection, the others are always symmetric for that reflection. Hence, the 
product is antisymmetric for that reflection. Another way to show that the 2p orbitals 
are mutually orthogonal is to note that they behave like x, y, and z vectors and that these 
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vectors are orthogonal (i.e., perpendicular; orthogonality in functions is equivalent to 
perpendicularity in vectors). Sometimes the orthogonality of functions is most clearly 
seen if we sketch out the product and note whether the positive and negative regions 
cancel by symmetry (Fig. 4-7). 

The n —3 level has nine solutions associated with it. The 3s orbital, plotted in 
Fig. 4-5, has one more node than the 2s orbital and is more diffuse. The 3p orbitals have 
the same angular terms as did the 2p orbitals so they can be written as real functions 
having the same directional properties as x, y and z vectors. The 3p orbitals differ from 
the 2p orbitals in that they possess a radial node and are more diffuse (see Fig. 4-8). 
The remaining five levels, 3d levels, may also be written in either complex or real form. 
The real orbitals are given by the formulas 


3d5 = (1/v3) (3 cos? 9 — 1) 


iu = 2 Z 3/2 y 2 —Zr m 0 oe 26 
xy= 25927 Vao Sdn exp 3a sin^ 0 sin 2 
3dyz = sin 20 cos $ 
3d, = sin 20 sing (4-30) 


These angular factors, times r?, have directional properties identical to the Cartesian 
subscripts on the left, except that 3d.» is a shorthand for 3d4». ,». These orbitals are 
sketched in Fig. 4-8. Itis obvious from these figures that 3d,2_ „2 has the same symmetry 
and orientation as the sum of the two vectors x? and — y’, and that the other 3d orbitals 
have a similar connection with the notation (except for 3d,2). The 3d functions are 


Product C+) 
Ca) 

Equal and opposite regions 
cancel by symmetry 


s times 2p, 


[/ Product x 2d 
e © 
Equal and opposite regions 
2p, times 2p, cancel by symmetry 


Figure 4-7 » Drawings of orbitals and their products to demonstrate orthogonality. 
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3d 


xz 


Figure 4-8 » Some hydrogenlike orbitals at the n = 3 level. 


about the same size as the 3p and 3s functions, but have no radial nodes at intermediate 
r values. 

A general pattern emerges when we examine the nodal properties of the orbitals at 
various energies. At the lowest energy we have no nodes and the level is nondegenerate. 
At the next level, we find that each function possesses a single node. There is one way to 
put in a radial node and so we get one 2s orbital. Or we can put ina planar node. But we 
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have three choices for orthogonal orientations of this plane leading to three independent 
p orbitals. At the n = 3 level we find orbitals containing two nodes. The possibilities 
are: two radial nodes (3s), a radial node and a planar node (3px, 3p,, 3pz), two planar 
nodes (3d, ,, 3d,;, 3dyz, 3d,2_,2, 3d. ,2, 3d2_,2). (But, since z-y-(g-x5- 
x? — y?, the last three 3d orbitals are not linearly independent. Hence the last two 
are combined to form 3d»: Z—x).g2-—yzsg—(x?.x-y.z2 zaz-r. 
This function can be seen to correspond to a positive dumbbell encircled by a small 
negative doughnut, or “belly band." The two nodes in this orbital arising from nodes in 
0 are conical surfaces rather than planes.) It is apparent that the degeneracies between 
various 2p orbitals, or 3d orbitals, are spatial degeneracies, due only to the physical 
equivalence of various directions in space. The degeneracy between 2s and 2p, or 3s, 
3p, and 3d is not due to spatial symmetry. The fact that an angular node is energetically 
equivalent to a radial node is a peculiarity of the particular potential (— Ze? /47 er) for 
this problem. This degeneracy is removed for noncoulombic central-field potentials. 
The eigenfunctions corresponding to states in the energy continuum, like the bound 
states, can be separated into radial and angular parts. The radial parts of the spherically 
symmetric eigenfunctions at two nonnegative energies are given in Fig. 4-9. Note that 
the rate of oscillation of these functions is greatest at the nucleus, where the local kinetic 
energy is largest, in accord with the ideas presented in Chapters 1 and 2. Unbound-state 
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Figure 4-9 » Radial part of unbound H-atom states (times r) versus r at two energies: (a) E — 
13.6eV; (b) E — 0. 
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wavefunctions are not used in most quantum chemical applications, so we will not 
discuss them further in this book. 


EXAMPLE 4-5 A nodal plane is one through which a wavefunction is antisymmetric 


for reflection. Consider the 3d,, and 3d,- orbitals of Fig. 4-8. Through which 
planes do these two orbitals have different reflection symmetries? 


SOLUTION » 3d, , is antisymmetric for reflection only through the x, z and y, z planes. 3d y; 
is antisymmetric for reflection only through the x, z and x, y planes. Hence, these orbitals differ 
in their symmetries for reflection through the y, z and x, y planes. 4 


[J 4-2 Separation of Variables 
We shall indicate in some detail the way in which the Schródinger equation (4-6) is 
solved. Recall the strategy of separating variables which we used in Section 2-7: 
1. Express y as a product of functions, each depending on only one variable. 
2. Substitute this product into the Schródinger equation and try to manipulate it so that 


the equation becomes a sum of terms, each depending on a single variable. These 
terms must sum to a constant. 


3. Since terms for different variables are independent of each other, the terms for each 
variable must equal a constant. This enables one to set up an equation in each 
variable. If this can be done, the initial assumption (1) is justified. 


In this case we begin by assuming that 


V (r, 6,6) = R()6(0)9(9) (4-31) 
Substituting into Eq. (4-6) gives 


—h? op (7) x 1 d (s e) ds 1 do 
r sin | 

8z?ur? dr dr sin d0 do sin? 6 d$? 
Ze 


RO®=ERO® (4-32) 


4n Eor 


Since each derivative operator now acts on a function of a single coordinate, we use 
total, rather than partial, derivative notation. 

Let us first see if we can isolate the $ dependence. Multiplying Eq. (4-32) by 
(—87 pr? sin? 0 / h? RO) and rearranging gives 


sin?6 d ( ,dR 827 ur? sin?0 E+ Ze? 

R dr dr) | h2 4T Eor 
sind d dO 1 d? 
—  — [sinó—— }+——— =0 (4-33 
© 5 (sin m) was wee 


The r and 6 dependence is still mixed in the first two terms, but we now have a rather 
simple term in the coordinate $. Now we can argue, as in Section 2-7, that, as @ alone 
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changes, the first three terms in Eq. (4-33) do not change. That is, if only $ changes, 
Eq. (4-33) may be written 


constant + constant + constant + (1/45) (qd? o Jd”) =0 (4-34) 
so that 
(1/9) (d? /dQ?) = —m? (a constant) (4-35) 


We call the constant —m? for future mathematical convenience. We can rearrange 
Eq. (4-35) into the more familiar form for an eigenvalue equation: 


d?o/d$? = —m?o (4-36) 


We arrived at Eq. (4-36) by assuming that only @ changes while r and 0 are constant. 
However, it should be obvious that the behavior of the term in $ is uninfluenced by 
changes in r and 0 since it has no dependence on these coordinates. Thus, by estab- 
lishing that this term is constant under certain circumstances, we have actually shown 
that it must be constant under all circumstances, and we have produced an eigenvalue 
equation for ®. 

We can now proceed with further separation of variables. Since we know that the 
last term in Eq. (4-33) is a constant, we can write 


1 d (54RY 8n7pr? ze 1 d(, do m? 
r | E+ | : sind =0 
Rdr\ dr h? 4reor) Osin0 dO dO | sin?8 


(4-37) 


Note that we have separated the 6 and r dependences by dividing through by sin? 0. 
We now have two terms wholly dependent on r and two wholly dependent on 06, their 
sum being zero. Hence, as before, the sum of the two r-dependent terms must equal a 
constant, £, and the sum of the 6-dependent terms must equal — f. Thus 


d ( „dR 827 ur? E Ze? R=BR (4-38) 
r t l = S 
dr dr h2 4r &or 
1 d dO mO 
sind do (n a) ae oe 


where we have multiplied through by R in the first equation and by © in the second. 

The assumption that y = RO 4 has led to separate equations for R, ©, and ®. This 
indicates that the assumption of separability was valid. However, there is some linkage 
between R and © via p, and between © and ® via m. 


E 4-3 Solution of the R, O, and ® Equations 
4-3.A The 6 Equation 


The solution of Eq. (4-36) is similar to that of the particle in a ring problem of 
Section 2-6. The normalized solutions are 
$ — (1//2z)expGmo), m=0,+1,42,... (4-40) 


As shown in Section 2-6, the constant m must be an integer if ® is to be a single-valued 
function. 
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4-3.B The © Equation 


There is great similarity between the mathematical techniques used in solving the R and 
© equations and those used to solve the one-dimensional harmonic oscillator problem 
of Chapter 3. Hence, we will only summarize the steps involved in these solutions and 
make a few remarks about the results. More detailed treatments are presented in many 
texts.? 

The © equation can be solved as follows: 


1. Change the variable to obtain a more convenient form for the differential equation. 
2. Express the solution as a power series and obtain a recursion relation. 


3. Observe that the series diverges for certain values of the variables, producing 
nonsquare-integrable wavefunctions. Correct this by requiring that the series termi- 
nate. This requires that the truncated series be either symmetric or antisymmetric 
in the variable and also that 6 of Eq. (4-38) and (4-39) be equal to /(/ 4- 1) with / 
an integer. 


4. Recognize these truncated series as being the associated Legendre functions. 


5. Return to the original variable to obtain an expression for O in terms of the starting 
coordinate. 


Reference to the end of Section 3-4 will illustrate the similarity between this and the 
harmonic oscillator case. 
The final result for m > 0 is 


QI 4-1) 0 — |m)! 
2. Q-c|Imp! 


1/2 
O1,m(O) = ( »"| | BJ"! (cos@) (4-41) 


For m <0 the phase factor (—1)" should be omitted. The term in square brackets 
is a normalizing function, and p (cos 0) represents some member of the series of 
associated Legendre functions. When m — 0, these become the ordinary Legendre 
polynomials. The first few ordinary Legendre polynomials are 


Py(x)21, Pi(x)2x, P(x)— 56x -p 
P3(x) = 5x3 — 3x) (4-42) 


The first few associated Legendre functions are 


Pœ =A, Iesse xax, 
P3(x)=3(1—x?), Plo)230-32)/7(5x?- 1), (4-43) 
P2(x)215( —x?)x, P2(x) = 15(1 — x2)? 


It is also true that 


pP"j-o0 if |m|>1 (4-44) 


58ee, e.g., Pauling and Wilson [3, Chapter 5]. 
This is the same phase factor that we saw earlier for yp | 
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Thus, © (0), and hence v (r, 0, $), vanishes unless |m| < /, giving us one of our quantum 
number rules [Eq. (4-19)]. 
The associated Legendre functions satisfy an orthogonality relation: 


+1 2 (+ |m|)! 
|m| |m| = ; : 
E Hr CO SOS SEO DI Si 


For a further discussion of these functions, the reader should consult a more advanced 
text on quantum mechanics. 


4-3.C The R Equation 


The R equation can be solved as follows: 


1. Assume that E is negative (this restricts us to bound states), and note that 6 — /(/ + 1) 
from the previous solving of the O equation. 


2. Change variables for mathematical convenience. 


3. Find the asymptotic solution pertaining to the large r limit, where the R equation 
becomes simplified. 


4. Express the wavefunction as a product of the asymptotic solution and an unknown 
function. Express this unknown function as a power series and (after dealing with 
some singularities) obtain a recursion relation. 


5. Note that the power series overpowers the asymptotic part of the solution unless the 
series is truncated. This leads to the requirement that n be an integer and hence that 
E be quantized. It also requires that n > l. 


6. Recognize the truncated series to be associated Laguerre polynomials times p/, 
where p is defined below. 


The resulting solution is, if u = me, 


1/2 
2249 (p 1-1 UN 
Bur -p[2)p L 4-46 

(22) HR exp (-0/2) p Lay (P) (4-46) 


where o =2Zr/nao and ao = coh? /n mee? —5.2917706 x 107 !! m. The term in brack- 
ets is a normalizing function. The exponential term is the asymptotic solution and it 
guarantees that R(r) will approach zero as r approaches infinity. The third term, p’, is 
produced in the course of removing singularities (i.e., places where parts of a differen- 
tial equation become infinite). The last term, L(o), symbolizes the various members 
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of the set of associated Laguerre polynomials. Like the Legendre functions, these 
are mathematically well characterized. A few of the low-index associated Laguerre 
polynomials are 


Lips L3(p) =2p —4, 


(4-47) 
L}(p) =—3p? + 18p — 18, L3(p) =—6 


[ 4-4 Atomic Units 


It is convenient to define a system of units that is more natural for working with atoms 
and molecules. The commonly accepted system of atomic units for some important 
quantities is summarized in Table 4-1. [Note: the symbol A (“h-cross or h-bar") is 
often used in place of h/2z.] Additional data on values of physical quantities, units, 
and conversion factors can be found in Appendix 10. 

In terms of these units, Schródinger's equation and its resulting eigenfunctions and 
eigenvalues for the hydrogenlike ion become much simpler to write down. Thus, the 


TABLE 4-1 » Atomic Units 


Values of some atomic 
properties in atomic units 


Quantity Atomic unit in cgs or other units (a.u.) 
Mass me= 9.109534 x 10778 g Mass of electron — 1 a.u. 
Length ao — Az eo? / mee? Most probable distance of 1s 
=0.52917706 x 107! m electron from nucleus of H 
(= 1 bohr) atom = l a.u 
Time To= aoi / e Time for 1s electron in H 
—2.4189 x 1077 s atom to travel one bohr 
—] a.u. 
Charge e= 4.803242 x 10-1? esu Charge of electron = — 1 a.u. 
= 1.6021892 
x10-? coulomb 
Energy e? [An &9a9 = 4.359814 x 10715] Total energy of 1s electron in 
(=27.21161 eV = 1 hartree) H atom = —1/2 a.u. 
Angular A=h/2n Angular momentum for 
momentum = 1.0545887 x 10-?^Js particle in ring = 0, 1, 
255: WA 
Electric field e/ aĝ = 5.1423 x 10? V/cm Electric field strength at 
strength distance of 1 bohr from 


proton — 1 a.u. 
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TABLE 4-2 » Eigenfunctions for the Hydrogenlike Ion in Atomic Units 


Spectroscopic symbol Formula 

Is (1/ /1) Z?? exp(—Zr) 

2s (1/44/22) Z?? (2 — Zr) exp(—Zr/2) 

2px (1/4 /2x) Z2? r exp( — Zr/2)sin0 cos 

2py (1/44/2z) Z2? r exp( — Zr/2) sin 0 sing 

2p; (1/4/27) Z2? r exp(—Zr/2) cos 0 

3s (1/814/31) Z2? (27 — 18Zr -- 2Z?r?) exp( Zr/3) 
3p. (4/2/81 /1) Z2? r(6 — Zr) exp( —Zr/3) sin0 cos ó 
3ps (4/2/81, /2) Z2? r(6 — Zr) exp(—Zr/3) sind sing 
3p: (2/81 4/7) Z9? r(6 — Zr) exp(—Zr/3) cos0 

3d» (= 3d3,2_,2) (1/81/62) Z/? r? exp(—Zr/3)(3.cos* 0 — 1) 
3d,2_,2 (1/81/21) Z7/*r? exp(— Zr/3) sin? 6 cos 2 

3dxy (1/81/27) Z7/*r? exp( — Zr/3) sin? 0 sin2ó 

3d,. (1/81/27) Z/? r? exp( — Zr/3) sin20 cos 

3d; (1/81/27) Z7/*r? exp(—Zr/3) sin 20 sind 


Schrödinger equation in atomic units is (assuming infinite nuclear mass, so that u = me) 


lez Z 
—-V^—— |y —Ewy (4-48) 
2 r 
The energies are 
Z2 
En mg 2n? (4-49) 


The lowest-energy solution is 


Vis =,/ Z3/m exp(—Zr) (4-50) 


The formulas for the hydrogenlike ion solutions (in atomic units) of most interest 
in quantum chemistry are listed in Table 4-2. The tabulated functions are all in real, 
rather than complex, form. Problems involving atomic orbitals are generally far easier 
to solve in atomic units. 


N 4-5 Angular Momentum and Spherical Harmonics 


We have now discussed three problems in which a particle is free to move over the 
entire range of one or more coordinates with no change in potential. The first case 
was the free particle in one dimension. Here we found the eigenfunctions to be simple 
trigonometric or exponential functions of x. The trigonometric form is identical to the 
harmonic amplitude function of a standing wave in an infinitely long string. We might 
refer to such functions as “linear harmonics.” The second case was the particle-in-a- 
ring problem, which again has solutions that may be expressed either as sine-cosine or 
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exponential functions of the angle $. By analogy with linear motion, we could refer 
to these as “circular harmonics.” Finally, we have described the hydrogenlike ion, 
where the particle can move over the full ranges of 0 and 6 (i.e., over the surface of a 
sphere) with no change in potential. The solutions we have just described—the prod- 
ucts ©; m(0)®m(ġ)—are called spherical harmonics and are commonly symbolized 
Yı (0, $). Thus for m > 0 


QI 1) Q — Im! 
4n — (L4 |m|)! 


1/2 
Y, (6.0) - (-D" | | P)"'(cos@)exp(img) (4-51) 
and for m <0 the factor (—1)" is omitted. Because so many physical systems have 
spherical symmetry, spherical harmonics are very important in classical and quantum 
mechanics. 

Closely linked with spherical harmonics is angular momentum. Angular momentum 
is an important physical property because it is conserved in an isolated dynamical 
system; it is a constant of motion for the system. Angular momentum is described by 
magnitude and direction, so it is a vector quantity.’ The classical system, in the absence 
of external forces, is constrained to move in such a way as to preserve both the direction 
and the magnitude of this vector. For a mass of m kg moving in a circular orbit of 
radius r m with an angular velocity of w radians per second, the angular momentum 
has magnitude mr?@ kg m?/s (or, alternatively, joule seconds). The direction of the 
vector is given by the right-hand rule: the index finger of the right hand points along the 
particle trajectory and the extended thumb points along the angular momentum vector 
(see Fig. 4-10). (Alternatively, in a right-handed coordinate system, motion of a mass 
in the x y plane from 4-x toward 4- y produces angular momentum in the 4-z direction.) 
In vector notation, L =r x p, where L is angular momentum, r is the position vector, 
and p is the linear momentum. 

Some of the more interesting properties of angular momentum relate to the situation 
where circular motion occurs in the presence of an external field. A familiar example 
is a gyroscope mounted on a pivot and experiencing the gravitational field of the earth. 
The gyroscope flywheel is usually started with the gyroscope in an almost vertical 
position. After release, the gyroscope precesses about the axis of field direction. As 
time passes, the tilt of the gyroscope away from the field direction (which we take to be 
the z direction) increases (see Fig. 4-11). If there were no friction in the bearings, the 
angle of tilt would not change, and the gyroscope would precess about z indefinitely, 
maintaining whatever angle of tilt it found itself with initially. Notice that, in such 


mr? c 


Figure 4-10 » The angular momentum vector L for a particle of mass m moving with angular 
velocity w about a circular orbit of radius r in the direction indicated. 


"Strictly speaking, angular momentum is a pseudovector—it is dual to a second order antisymmetric tensor. 
However, for the remainder of this book, we can and shall ignore this distinction. 
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a case, the angular momentum due to the flywheel, Lr, is conserved in magnitude 
only. Its direction is constantly changing. Thus, Lr is not a constant of motion in the 
presence of a z-directed field. Neither are the components Ly, and Ly,, which change 
in magnitude as the gyroscope precesses. However, Lr, is a constant of motion if the 
angle of tilt does not change. If we add on to Lẹ the angular momentum Lg due to the 
precession of the gyroscope as a whole (including the center of mass of the flywheel 
but ignoring its rotation), we find that the total angular momentum for the gyroscope 
(including flywheel motion), L and its components Lx, Ly, and Lz behave similarly 
to Lr and its components (see Fig. 4-12). We may summarize these observations from 
classical physics as follows: a rotating rigid body conserves L (hence, Ly, Ly, Lz) 
in the absence of external forces. In the presence of a z-directed, time-independent 
external force, Lz and |L], the magnitude of L (but not its direction) are conserved. 
Furthermore, in a system comprising several moving parts, the total angular momentum 
is the sum of the individual angular momenta, and the z component is the sum of the 
individual z components: 


L-'L (4-52) 
boe X s (4-53) 


Many characteristics of the classical situation are maintained in quantum mechan- 
ics. In particular, it can be shown that a hydrogenlike ion eigenfunction can always 
be associated with “sharp” values for L;, but not for L, or L,, and that the mag- 
nitude of L is sharp, but not its direction. We have indicated several times in this 
book that a sharp value (i.e., a constant of motion) exists when a state function is an 
eigenfunction for an operator associated with the property. For example, all of our 
hydrogenlike ion wavefunctions are eigenfunctions for the hamiltonian operator, so all 
are associated with sharp energies. We introduced the operator for the z-component of 
angular momentum, in Section 2-6, as (h/27zi)d /d$. Generalizing to situations with 
several variables requires switching to partial derivative notation. Then our operator, 


Path of gyroscope 


Figure 4-11 » A gyroscope with the angular momentum of the flywheel, Lr, together with x, y, 
and z components of Lp, at a given instant. 
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Figure 4-12 » The total angular momentum of the gyroscope L is shown as the sum of Lr, the 
angular momentum of the flywheel, and Lg, the angular momentum of the gyroscope. L precesses, 
so only L- and the magnitude of L are constants of motion. 


symbolized TE is (4/i)d/d@. In atomic units, L is (1/i)9/89. This operator was 
introduced in Section 2-6, where it was given the symbol pg. (A general discussion on 
operators will be given in Chapter 6. The carat symbol is frequently used to denote an 
operator.) Our statement that hydrogenlike eigenfunctions have sharp L; means that 
we expect Lila Lm (r, 09, 9) = constant - V, 1,m(r, 0, Q). Since all these eigenfunctions 
have exp(im@) as their only $-dependent term, it follows immediately that 


Lam —mhwys l,m (4-54) 
or, equivalently, 
LY, (0, P) 2 mhY, s, (0,0) (4-55) 


Hence, the quantum number m is equal to the z component of angular momentum in 
units of Å for the state in question. This means that the angular momentum associated 
with an s state (/ — 0, so m = 0) has a zero z component, while a p state (/ = 1, so 
m = —1,0, 4-1) can have a z component of —A, 0, or h. 

The other quantity that we have stated is conserved in these systems is the magnitude 
of L. In quantum mechanics, it is convenient to deal with the square of this magnitude 
L?. The quantum-mechanical operator associated with this quantity is 


Ê? = —h* [(82/90?) + coto (3/80) + (1/ sin6) (02/39?)] 
= —h? [(1/ sin0)(8/80) sin0(9/80) + (1/sin?0)(92/99?)] — (4-56) 


The result of operating on Y; ,, (0, $) with this operator is 
Ê?Yi m (0,0) =I + DA? Y, (0, G) (4-57) 


This means that the square of the magnitude of the total angular momentum equals 
d+ DA. Hence, for an s state it is zero, for a p state it is 2h?, for a d state it is 62, etc. 
One can construct vector diagrams to parallel these relationships. A few of these are 
sketched in Fig. 4-13. 
Operators for L, and L y can also be constructed. They are 


ie = ih[sin $ (8/80) + cot cos¢(d/d¢)] (4-58) 


Ly = —ih[cos $ (09/80) — cot0 sin $ (9/99)] (4-59) 
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N 


(b) 


(c) 


Figure 4-13 » Vector relationships that satisfy the following rules: L?-Iü041,Lz;—m, 
m —-—l,—l41,...,0,...,1— 1,1. The quantum rules correspond to a classical analog where the 
gyroscope can have only certain discrete angles of tilt. (a) s; / 20; I(/ 4-1) 20; m=0. (b) p; 
I—- Lll -1) 22; m— —1,0,1. (c) d;/=2;/7+1)=6; m = —2, —1,0, 4-1, +2. (all in atomic 
units) 


The hydrogenlike eigenfunctions are not necessarily eigenfunctions for either of these 
operators (Problem 4-23). 

It is interesting to consider the physical meaning of these results. If a quantity has a 
sharp value, it means that we will always get that value no matter when we measure that 
property for systems in the state being considered. Thus, the z component of angular 
momentum for hydrogen atoms in the 2p.,1 state will always be measured to be 4-17, 
or one atomic unit. For the x or y component, however, repeated measurements (on an 
ensemble of 2p.,.1 atoms) will yield a spread of values. We can measure (or compute) 
an average value of Ly or Ly but not a sharp value. In terms of our mental model (a 
gyroscope) this seems sensible enough except for one thing. Our hydrogenlike eigen- 
functions are solutions for a central field potential with no external field. Under such 
conditions, L, Ly, Ly, and Lz are classically all constants of motion. Why, then, are 
they not all sharp quantum mechanically? The answer is that quantum-mechanical state 
functions never contain more information than is, in principle, extractable by measure- 
ment. To measure a component of angular momentum in a system always means, in 
practice, subjecting the system to some sort of external force. Furthermore, this system 
must obey the limitations decreed by the uncertainty principle. The hydrogenlike ion 
wavefunctions cannot simultaneously be eigenfunctions for L x3 L y, and b because 
that would give simultaneous sharp values (i.e., no uncertainty) for the conjugate vari- 
ables angular momentum vector length and angular momentum vector orientation. This 
would violate the uncertainty principle, which is in turn a reflection of limitations on 
our ability to measure one variable without affecting another (see Section 1-8). 

It is possible, working only with the quantum-mechanical operators, to generate 
the eigenvalues of Ê, and L?. This approach is a deviation from our main line of 
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development and is contained in Appendix 4. (It is recommended that Chapter 6 be 
completed before reading Appendix 4.) We give only the results. They are 


afim =mifim, m=—l,-1+1,...,1-1,1 (4-60) 
È? fim —104-1) 8 fim (4-61) 


These look like the results already given in Eqs. (4-55) and (4-57). There is a difference, 
however. Here there is no indication that m is an integer, whereas in Eq. (4-55) m 
must be an integer, as indicated by the presence of zero in its value list. There are two 
ways in which we can have a sequence of the form —/, —/+1,...,/—1,/. One way 
is to have an integer series, for example, —2, —1,0, +1, +2, which must contain zero. 
The other way is to have a half-integer series, for example, — 3, — 1, +5, +3, which 
skips zero. If we work only with the properties of the operators, we find that either 
possibility is allowed. But if we assume that the as-yet-unspecified eigenfunctions fi m 
are separable into 0- and $-dependent parts, we find ourselves restricted to the integer 
series. For orbital angular momentum (due to motion of the electron in the atomic 
orbital), the z component must be (in atomic units) an integer, for we have seen that 
the state functions y contain the spherical harmonics Y; m, which are indeed separable. 
Spin angular momentum for an electron (to be discussed in more detail in the next 
chapter), has half-integer z components of angular momentum, and the eigenfunctions 
corresponding to spin cannot be expressed with spherical harmonics. 


N 4-6 Angular Momentum and Magnetic Moment 


If a charged particle is accelerated, a magnetic field is produced. Since circular motion 
of constant velocity requires acceleration (classically) it follows that a charged particle 
having angular momentum will also have a magnetic moment. 'The magnetic moment 
is proportional to the angular momentum, colinear with it, and oriented in the same 
direction if the charge is positive. For an electron, the magnetic moment is given by 


p= - ÉL (4-62) 


where Be, the Bohr magneton, has a value of 9.274078 x 10:95] T3 (equal to 1a.U.), 
where T is magnetic field strength in Tesla. (8, is defined to contain the A that belongs 
to L, so it is only the 4//(/ + 1) part of L that is used in the calculation.) 


EXAMPLE 4-6 What is the magnitude of the orbital magnetic moment for an elec- 


tron in a 3d state of a hydrogen atom? In a 4d state of Het? 


SOLUTION » For any d state, / = 2, so, in au, |u| = BeL = Be / II F D = 68, = 
2.27 x 10-23JT-!. (Since we want magnitude, we can ignore the minus sign.) The value does 
not depend on the quantum number n nor on atomic number Z, so it is the same for He*. 4 


Applying a magnetic field of strength B defines a z-direction about which the mag- 
netic moment vector precesses. The z-component, uz, of the precessing vector interacts 
with the applied field B. The interaction energy is 


E=—pzB=feL:B = bem B (4-63) 
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This means that some of the degeneracies among the energy levels of the hydrogenlike 
ion will be removed by imposing an external magnetic field. For instance, the 2p. 1 
and 2p. , energy levels will be raised and lowered in energy, respectively, while 2s and 
2po will be unaffected (see Fig. 4-14). This, in turn, will affect the atomic spectrum 
for absorption or emission. The splitting of spectral lines due to the imposition of an 
external magnetic field is known as Zeeman splitting. Because the splitting of levels 
depicted in Fig. 4-14 is proportional to the z component of orbital angular momentum, 
given by m, it is conventional to refer to m as the magnetic quantum number. 

In the absence of external fields, eigenfunctions having the same n but different / 
and m are degenerate. We have seen that this allows us to take linear combinations 
of eigenfunctions, thereby arriving at completely real eigenfunctions like 2p, and 2p y, 
instead of 2p,) and 2p_;. When a magnetic field is imposed, the degeneracy no longer 
exists, and we are unable to perform such mixing. Under these conditions, 2px, 2py, 
3d,,, etc. are not eigenfunctions, and we are restricted to the pure m = 0, +1, 2... 
type solutions. 

Thus far we have indicated that the stationary state functions for the hydrogenlike 
ions are eigenfunctions for L? and Be and we have compared this to the fact that |L| 
and L; are constants of motion for a frictionless gyroscope precessing about an external 
field axis. But how about atoms with several electrons? And how about molecules? Are 
their stationary state functions also eigenfunctions for Î? and L,2 A general approach 
to this kind of question is discussed in Chapter 6. For now we simply note that the 
spherical harmonics are eigenfunctions of L? and Ê [Eqs. (4-55) and (4-57)] and 
that any state function of the form y (r, 0, $) = R(r)Yi m(0, $) will necessarily be an 
eigenfunction of these operators. But the spherical harmonics are solutions associated 
with spherically symmetric potentials. Therefore, it turns out that eigenfunctions of 
the time-independent hamiltonian operator are also eigenfunctions for L? and Ê, only 
if the potential is spherically symmetric. In the more restricted case in which y has 
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Figure 4-14 » Energy levels of a hydrogenlike ion in the absence and presence of a z-directed 
magnetic field. 
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the form v (r, 0, 6) = f(r, O) exp(imQ), Y will still be an eigenfunction of E; but not 
of L?. This situation applies in systems having cylindrically symmetric potentials, 
dependent on r and 0 but not ¢ (e.g., Hy, N2). We discuss such cases in more detail in 
Chapter 7. 


E 4-7 Angular Momentum in Molecular 
Rotation—The Rigid Rotor 


We have seen that the two-particle system of an electron and a nucleus rotating about a 
center of mass (COM) can be transformed to the one-particle system of a reduced mass 
rotating about a fixed point. However, this transformation can be made for any two-mass 
system, and so it applies also to the case of the nuclei of a rotating diatomic molecule. 
As we now show, the mathematical outcome for the rotating diatomic molecule is 
strikingly similar to that for the hydrogenlike ion. 

The simplest treatment of molecular rotation ignores vibrational motion by assuming 
that the distance between the nuclei is fixed. The resulting model is therefore called 
the rigid-rotor model. Let there be two nuclear masses, mı and m», separated from the 
COM by distances rı and r2, respectively. Then, because of the way that the COM is 
defined, we have that 


mırı =m (4-64) 
The moment of inertia, J, is 
f= 2 2 
—mijrj + mor» (4-65) 


It is not difficult to show (Problem 4-35) that the same moment of inertia results from 
a reduced mass u rotating about a fixed point at a distance r =r, + r2. That is, if 
mm» 


———— 4-66 
m,+m2 ( ) 


then 
I-yur? (4-67) 


Therefore, solving the problem of a reduced mass u rotating about a fixed point at the 
fixed distance r = r; + r2 is equivalent to solving the two-mass rigid-rotor problem. In 
effect, the rotating-diatomic problem is transformed to a particle-on-the-surface-of-a- 
sphere problem. 

As usual, we write the Schródinger equation by starting with the general prescription 
[(—A2/2,) V? + V]y = Ew. Then we recognize that V is constant over the spherical 
surface (corresponding to the diatomic molecule having no preferred orientation), so 
we can set V — 0. Since r is a constant, the first term in V? [Eq. (4-7)] vanishes due to 
the 0/dr operators. The resulting Schrödinger equation is [using Eq. (4-67)] 


[(1/sin0)(8/30) sin (9/30) + (1/ sin? 09) (92/99?)] V (0, p) = (C21E /h?) y (6, $) 
(4-68) 
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Equation (4-68) is the same as the ©, ® equations seen earlier [(4-36) and (4-39)] 
with 6 —21E/h, and so we already know the eigenvalues and eigenfunctions. We 
noted earlier that 8 — /(/ + 1). For molecular rotation it is conventional to symbolize 
the O quantum number as J, rather than /. This leads to 


J(J +1) Z2IEJR? (4-69) 
or 
ESEJI FIRE J—0,,2,... (4-70) 


Since V =0, E is entirely kinetic energy. Because r is not a variable, there is no analog 
to the principal quantum number n, and J is not limited in its highest value. (Note also 
that atomic units, which are designed to simplify electronic problems, are not normally 
used for molecular rotation or vibration.) 

The eigenfunctions are, as before, the spherical harmonics YJ m ,(0, $), with m ; = 
0, X1, +2, ... , +J. Thus we have an s-type solution (J =0, m z — O0) that has constant 
value over the spherical surface, three p-type solutions (J = 1, m z = +1, 0, —1), five 
d-type solutions, etc. For each value of J, there are 2J 4- 1 eigenfunctions. 

The s-type solution has zero energy. One can imagine that the reduced mass is 
motionless on the surface of the sphere and has equal probability for being found 
anywhere. This transforms back to a picture where the diatomic molecule is not 
rotating and where there is no preferred orientation. Since E — 0 when J — 0, we 
conclude that there is no zero-point energy for free rotation. (However, if rotation is 
restricted so that some orientations become preferred, the zero-point energy becomes 
finite.) 

The three p-type solutions are degenerate, hence can be mixed to give real functions 
analogous to py, Py, pz. We could represent the pz function by taking a globe and 
marking a circular region of positive phase around the northern polar region, with a 
matching region of negative phase around the southern pole. Clearly, these rigid-rotor 
wavefunctions have the same symmetries as their hydrogenlike counterparts. A p; 
rotational state corresponds to a molecule (or ensemble of molecules) rotating with 
kinetic energy 27/21 (since J = 1) with little likelihood of finding the reduced mass 
near the equator (i.e., with little probability of finding the diatomic molecule oriented 
nearly perpendicular to the z axis.) 

Angular momentum for the rigid rotor also follows the hydrogenlike system rules. 
The square of the total angular momentum equals J(J + 1)A?, and the z component 
equals m sh. 

Transitions between rotational energy levels can be detected spectroscopically. Once 
such energy differences are identified with specific changes in J, it becomes a simple 
matter to solve for r, the internuclear distance in the rigid rotor. 

For example, suppose an absorption peak for H?!Br seen at 101.58cm7! is 
assigned to the J = 6 < 5 transition. Since the energies of these levels are 
42h?/21 and 30h?/21, respectively, their difference is 1247/27. This energy is 
equal to that of the photons of light at 101.58 cm-!. Solving this relation gives 
I — 3.3069 x 1077 kg m?. We know that this equals ur?, and we know how to get 
u from mg and mpr [Eq. (4-46)]. So we can solve for r, finding r = 141.44 pm 
(Problem 4-37). 
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N 4-8 Summary 


1. 


The motion of two masses moving about a center of mass can be transformed to 
motion of a single reduced mass moving about a fixed point. The radius of rotation 
for the reduced mass is identical to the distance of separation of the original two 
masses. For hydrogenlike ions, the nuclear mass is so much greater than the electron 
mass that the reduced mass is almost identical to the electron mass. 


. The bound-state energies for time-independent states of the hydrogenlike ion depend 


on only the atomic number Z and the quantum number n (a positive integer) and 
vary as — Z?/2n?. This means that the energies get closer together as n increases 
and that there is an infinite number of such negative energy levels. Each such energy 
level has degeneracy n?. A continuum of energies exists for unbound (E > 0) states. 


. Each stationary state wavefunction is characterized by three quantum numbers, n, 


l, and m, all integers, with / ranging from 0 to n — 1 and m ranging from —/ to 
-H. If | —0, we have an s state and w is spherically symmetric with a cusp at the 
nucleus. If / — 1, 2, ... we have a p, d, ... state, and y vanishes at the nucleus and 
is not spherically symmetric. In all states there is a finite probability for finding the 
electron beyond the classical turning point. 


. Eigenfunctions Rn ;(r)8; 4, (0) 6,, (6) with m z: 0 are complex but can be mixed to 


form real eigenfunctions. However, if an external field causes states of different m 
to be nondegenerate, such mixing will produce noneigenfunctions. 


. All the stationary state eigenfunctions are orthogonal to each other, and radial and/or 


angular (usually planar) nodes are instrumental in this. The effect of a radial node 
on energy is the same as that of an angular node, so that all eigenfunctions with, 
say, three nodes (all radial, all angular, or a combination) are degenerate. This is 
peculiar to the —r 7! potential. 


. Separation of variables is not "perfectly clean" since the differential equations for R 


and © (Eqs. (4-38) and (4-39)), are linked through £ and those for © and ® (Eq. 4-40) 
are linked through m. This leads to interdependencies in the values of n, l, and m. 


. Spherical harmonics are the angular parts of solutions to Schródinger equations for 


systems having spherically symmetric potentials. These functions are e 
tions of Ey and £? as well as H, so such states have sharp values of Lz, L?, and E. 
The value of L, is mh, and for L? it is /(1 + D)A?, where / and m must be integers. 
In atomic units the quantity Å does not appear in these formulas. 


. The z component of the magnetic moment due to orbital motion of a charged particle 


is proportional to mf, and so m is called the magnetic quantum number. This 
magnetic moment contributes to the Zeeman splitting seen in spectra of hydrogenlike 
ions in magnetic fields. 


. Eigenfunctions other than spherical harmonics exist for L? and [m but these are not 


separable into 0- and $-dependent functions. In these cases, / and m can be half- 
integers. These cases do not arise in orbital motion, but do arise in spin problems. 
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10. 


11. 
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If V is cylindrically symmetric [i.e., V = V (r, 0)], the eigenfunctions of the hamil- 
tonian are still eigenfunctions for jt but not for L?. Hence, m still equals the z 
component of angular momentum for such a system. Here, z is the direction of the 
axis of rotational symmetry, i.e., the internuclear axis of the molecule. 


The rules for allowed angular momentum magnitudes and orientations are the 
same for the rigid rotor as for the hydrogenlike ion. This leads to the following 
relationships in terms of the rotational quantum numbers J —0,1,2,..., m; = 
0, X1,..., XJ: 


Length of angular momentum vector =/J(J + DA. 

Component of angular momentum perpendicular to internuclear axis = m jh. 
Kinetic energy of rotation: Tj = J(J + LA? /21. 

Degeneracy of level: g; —2J +1. 


4-8.A Problems 


4-1. 


4-2. 


4-3. 


4-4. 


4-5. 


An observed spectroscopic transition in the hydrogen atom involves the 2 p < Is 
transition. Using Eq. (4-8), evaluate this energy difference in units of hertz (Hz). 
(1 Hz=1s7!.) Do the calculation using both me and u. (See Appendix 10 for 
constants and conversion factors.) How much error in this calculation, in parts 
per million, is introduced by ignoring the finite mass of the nucleus (i.e., using 
Me instead of u)? 


Sketch qualitatively, on the same r axis, V?(r) for a 1s state and 47r? for the 
variation of dv with r. Sketch the radial distribution function, which is the product 
of these two functions, and explain why it vanishes at r = 0, oo. 


For a hydrogen atom in the Is state, Y = (1/4/71) exp(—r), in atomic units. 


a) Calculate the value of r (in a.u.) at the classical turning point. 
b) Calculate the percentage of the electronic charge that is predicted to be beyond 
the classical turning point. (See Appendix 1 for useful integrals.) 


Using atomic units, compute for a 1s electron of the hydrogenlike ion [vy = 
(V Z3/n) exp(—Zr)]. (See Appendix 1 for useful integrals.) 


a) the most probable distance of the electron from the nucleus, 

b) the average distance of the electron from the nucleus, 

c) the distance from the nucleus of maximum probability density, 

d) the average value of the potential energy (V — —Z/r). Note how these quanti- 
ties depend on atomic number Z. Also, why do you think that, when Z — 1, (d) 
is not minus the reciprocal of (b)? Why is (d) lower than minus the reciprocal 
of (b)? 


Demonstrate by integration that the 1s and 2s orbitals of the hydrogen atom are 
orthogonal. (See Appendix 1 for useful integrals.) 
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4-6. 


4-7. 


4-8. 


Normalize the function r exp(—r) cos 0. 


Obtain the average value of position x, fora particle moving in a one-dimensional 
harmonic oscillator potential in a state with the normalized wavefunction 


y = (8/4837)! |Q/Bxo? — 12 /Bx | exp(—Bx?/2) 
[There is an easy way to do this problem.] 


For a particle in a one-dimensional box with boundaries at x — 0 and L and for 
any quantum number n: 


a) Show how you would set up the calculation for the mean square deviation of 
the particle from its average position. 

b) Explain qualitatively how you would expect the value of (a) to vary with 
quantum number n. 

c) Evaluate the expression from part (a) in terms of n and L. Calculate the value 
for n = 1, 2. Discuss the relative values of these numbers for reasonableness. 


. Sketch the 2p, and the 3d,, wavefunctions. Demonstrate, without explicitly 


integrating, that these are orthogonal. 


. Repeat Problem 4-4, but for the 2po wavefunction. 


. Find an expression for the classical turning radius for a hydrogenlike ion in terms 


of n,1, m, and Z. 


. Show that the sum of the charge distributions of all five 3d orbitals is spherically 


symmetric. 


. Try to answer the following questions without looking up formulas or using 


pencil and paper. Use atomic units. 


a) What is the energy of the hydrogen atom in the 1s state? 

b) What is the energy of Het when n = 1? n — 2? 

c) What is the degeneracy of the n = 5 energy level of hydrogen? 

d) How many planar nodes does the 4d, orbital have? How many radial nodes? 

e) Whatis the potential energy in a hydrogen atom when the electron is 0.50 a.u. 
from the nucleus? 


4-14. You should be able to answer the following questions (use a.u.) in your head or 


with trivial calculations. 
An unnormalized eigenfunction for the hydrogen atom is 


y = (27 — 18r + 2r?) exp(—r/3) 


a) What are the / and m quantum numbers for this state? 
b) How many radial nodes does this function possess? 
c) What is the energy of this state? 

d) What is the classical turning radius for this state? 
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4-15. 


4-16. 


4-17. 


4-18. 


4-19. 


4-20. 


4-21. 


4-22. 


4-23. 


4-24. 


4-25. 


4-26. 
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y = N(6r — r?) exp(—r/3) sin 0 sin @ is an eigenfunction, in a.u., for the hydro- 
gen atom hamiltonian. XN is the normalizing constant. Without looking at 
formulas in the text, answer the following questions by inspection: 


a) Is there a node in the r coordinate? If so, where? 
b) Which state is this? (Give orbital symbol—e.g., 1s, 2pz, etc.) 


Without comparing to tabulated formulas, state whether each of the following 
could reasonably be expected to be an eigenfunction (unnormalized) of the hamil- 
tonian for a hydrogenlike ion. Explain why. 


a) Q7 — 18Zr -2Z?r?) exp(—2Zr/3) 
b) rexp(Zr/2) sin0 cos @ 
c) rsin@ exp(—i@). 


Calculate the average value of x for the 1s state of the hydrogen atom. Explain 
why your result is physically reasonable. 


a) Calculate the most probable value of 0 in the 2p, state of the hydrogen atom. 
b) Calculate the values of 0 corresponding to nodal cones in the 3d,2 orbital. 


Demonstrate that Eq. (4-30) for 3d,, has the same angular dependence as the 
function x y. 


Verify Eq. (4-45) using Pi(x) with Pi(x) and with P3(x) [from Eq. (4-42)]. 
What does Eq. (4-45) imply for the integral over all space of Y3 po vaa. ,? 


Using Eqs. (4-51) and (4-43), write the normalized spherical harmonic function 
Y3,-2(6,@). For which type of hydrogenlike AO does this function give the 
angular dependence? 


Verify Eq. (4-55). 


Test the 2po eigenfunction to see if it is an eigenfunction for Ly or L y, [Eqs. 
(4-58) and (4-59)]. Show that the 1s function is an eigenfunction of Tis l3 y 


Lx and £2. Explain, in terms of the vector model, this seeming violation of the 
discussion in the text. 


Work out the value of ÊZ Yop, by brute force and show that the result agrees with 
Eq. (4-57). 


Sketch the vector diagram (as in Fig. 4-13) for the 4f orbitals of hydrogen. How 
does this compare to the diagram for the 6f orbitals of Het? 


Sometimes eigenfunctions for an operator can be mixed together to produce 
new functions that are still eigenfunctions. Listed below are some operators 
with pairs of their eigenfunctions. Indicate in each case whether mixtures of 
these pairs will or will not continue to be eigenfunctions for the operator shown. 
(You should be able to do this by inspection, using your knowledge of these 
systems.) 
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4-27. 


4-28. 


4-29. 


4-30. 


4-31. 


Operator Eigenfunctions 

a) H(1 dim. box) "m V3 

b) A (ring) sin3$ cos 3ó 

c) Lang. mom.) exp(3igd) exp(—3iQ) 
d) H (cubical box) W1,2,3 W2,2,2 

e) H (Hatom) Vos Wp» 

f) H (H atom) Vas Vas 


A hydrogenlike ion is in a state having a z-component of angular momentum 
equal to —2 a.u. 


a) What is the smallest possible value of the length of the angular momentum 
vector for this state? 
b) What symbol describes the state corresponding to your answer to part (a)? 


Calculate the average angular momentum, L+, for a particle in a ring of constant 
potential having wavefunction 


a) (1//7) sin3@ 
b) (1/27) exp(—3i¢) 


Evaluate each of the following integrals. Look for labor-saving approaches. 
Integrals are over all space unless otherwise indicated. 


a) f Wop, -Wap dv 
b) f Wop, L? yop, dv 
c) S Wop, Lx Wop,dv 
d) f Va; S Vp, dv 


e) J)” exp Qi) exp —3i9) do 


Evaluate each of the following in a.u. (You should be able to answer these by 
inspection.) Note: v, ;,,, stands for a hydrogen atom eigenfunction of H. 


a) E5354 

b) L’ Wap, 

c) Ap, 

d) (1/7) (0/06) Vp , 


For each of the following operators, indicate by “yes” or “no” whether 3p, 
(with Z — 1) is an eigenfunction. If it is, then also give the eigenvalue in a.u. 


a) -1v? — Ar 
b) -iv?-3jr 
c) bs 

d) —5V? 

e) D. 


f) 4 
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4-32. 


4-33. 


4-34. 


4-35. 


4-36. 


4-37. 


4-38. 


4-39. 


4-40. 


4-41. 
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gr 
h) 1/7 


For the 3s state of the hydrogen atom, estimate the amount of electronic charge 
between 211 pm and 213 pm. 


Calculate in hertz the splitting between 2 po and 2p, of a hydrogen atom by a 
magnetic field of 2 tesla. Compare this with the 2 p «— 15 transition energy (in 
parts per million). 


Try to answer (by inspection) the following questions about the n = 4 states of 
the hydrogen atom. 


a) What is the energy of the level, in a.u.? 

b) What is the degeneracy of the level? 

c) What values for the length of the orbital angular momentum vector (in a.u.) 
are possible? 

d) Into how many sublevels (of energy) is the n — 4 level split by imposition of 
a magnetic field? 

e) What is the degeneracy of the unshifted portion of the sublevels referred to 
in (d)? 


Show that the moment of inertia for two masses, mı and m», moving on a rigid 
massless bar about the center of mass at distances rı and r2, respectively, is 
identical to that of a reduced mass u =m m5 /(m| + m2) moving about a point 
at a distance of r — r4 4- r». 


Show that Eq. (4-68) can be written, using L2, in a form that is analogous to the 
classical relation for a freely rotating mass, L*/2 — T. 


Using that my = 1.0078 a.m.u. and mp; = 80.9163 a.m.u., calculate u and verify 
the value of r given at the end of Section 4-7. 


Assuming an internuclear distance of 127.5 pm in D*°Cl, compute the expected 
positions in cm7! for the absorption peaks corresponding to J= 1 «—0, 2 <1, 
3 «—2 (D = 2.0141 a.m.u., CI = 34.9688 a.m.u.). 


The J= 1 <0 transition in !2C!°O occurs at 3.86 cm" !. Calculate the internu- 
clear distance. ("c = 12 a.m.u. by definition, 160 = 15.9949 a.m.u.) 


HCI has a permanent electric dipole moment, which means that the reduced 
mass has a partial electric charge in the transformed version of the system. 
This in turn means that there is a magnetic moment vector parallel to the total 
angular momentum vector. Describe qualitatively what happens to the ener- 
gies of the J =3 rotational states when HCl is subjected to a uniform magnetic 
field. 


= 1/2 
Uncertainty in position in one dimension, Ax, is defined as E — a That is, 
it is the square root of the difference between the average squared position and 
the square of the average position. Calculate Ar for the 1s state of the hydrogen 
atom. 
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Multiple Choice Questions 
(Try to answer these without referring to the text or using pencil and paper.) 


1. A particle on the surface of a sphere has quantum number J — 7. The energy level 
to which this state belongs has a degeneracy of 


a) 56 
b) 49 
c) 42 
d) 14 
e) None of these. 


2. A particle on the surface of a sphere in the state having J —4, m; = 4 


a) has E = 165?/2I. 

b) has a z-component of angular momentum of 44. 

C) doesn't exist because this state violates quantum number rules. 
d) has a degeneracy of 20. 

e) None of the above is a true statement. 


3. HI and DI are made to undergo the same transition (say J — 11 «— J — 10, but the 
particular J values are not important). The light frequency inducing the transition 
for HI is equal to v. Approximately which frequency would you expect to induce 
the same transition for DI? 


a) 2v 
b) v/2 
c) J2v 
d) v///2 


e) None of these. 


4. The electronic energy of Li?* in the 2s state is 


a) the same as that of H in the Is state. 
b) nine times that of H in the 1s state. 
c) one-fourth that of H in the 1s state. 
d) four-ninths that of H in the 1s state. 
e) nine-fourths that of H in the Is state. 


5. Consider the following expressions, where w is a hydrogen-atom wavefunction. 


1) anne V*ry dv 2) 4 yr yr? sind =0 
3) Ly*y =0 4) Gv =0 


Which one of the following is a true statement? 


a) Expression 1 is equal to unity if y is normalized. 

b) Expression 4 is true when r is the position of a radial node. 
c) Expression 3 is true everywhere because v/* y is a constant. 
d) Expression 2 is true when r is at its most probable value. 

e) Expression 3 is true when r is at its most probable value. 
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6. A 5d, , atomic orbital has 


a) 2 planar and 1 radial nodes. 
b) 2 planar and 3 radial nodes. 
c) 3 planar and 1 radial nodes. 
d) 5 nodes all together. 

e) None of the above is correct. 


7. Fora hydrogen atom in ann = 4 state, the maximum possible z-component of orbital 
angular momentum is 


a) 2h 
b) 3f 
c) /12h 
d) V6h 


e) None of the above is correct. 


8. The following is an eigenfunction for the hydrogen atom: 
y = | 1/,/327a@ | (r/ao) exp(—r/2ao) cos 0 
Which one of the following statements about y is true? 


a) The term on the left, up to the exponential, is the normalizing constant. 
b) This w has a nonzero value at the nucleus. 

c) For this state, / = 1 and m; — 0. 

d) This state has a spherical electron cloud distribution. 

e) None of the above statements is true. 


9. The radial distribution function for a 1s state, 47r ry , indicates that 


a) the most probable value of the distance from the nucleus is zero. 

b) the average value of r is zero. 

c) the average value of r is greater than the most probable value. 

d) the average value of r is equal to the most probable value. 

e) the electron cloud density per cubic picometer is greatest at a radius other 
than zero. 
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Chapter 5 


Many-Electron Atoms 


E 5-1 The Independent Electron Approximation 


In previous chapters we have dealt with the motion of a single particle in various 
potential fields. When we deal with more than one particle, new problems arise and 
new techniques are needed. Some of these are discussed in this chapter. 

In constructing the hamiltonian operator for a many electron atom, we shall assume 
a fixed nucleus and ignore the minor error introduced by using electron mass rather 
than reduced mass. There will be a kinetic energy operator for each electron and 
potential terms for the various electrostatic attractions and repulsions in the system. 
Assuming n electrons and an atomic number of Z, the hamiltonian operator is (in atomic 
units) 


n n n-l n 
HQ.23,... 275 V- Yr) Y Ds » (5-1) 
i=l i=1 i=l j=i+1 

The numbers in parentheses on the left-hand side of Eq. (5-1) symbolize the spatial 
coordinates of each of the n electrons. Thus, | stands for x1, yj, Z1, or 71, 91, Q1, 
etc. We shall use this notation frequently throughout this book. Since we are not here 
concerned with the quantum-mechanical description of the translational motion of the 
atom, there is no kinetic energy operator for the nucleus in Eq. (5-1). The index i refers 
to the electrons, so we see that Eq. (5-1) provides us with the desired kinetic energy 
operator for each electron, a nuclear electronic attraction term for each electron, and an 
interelectronic repulsion term for each distinct electron pair. (The summation indices 
guarantee that 1/rj2 and 1/r?, will not both appear in H. This prevents counting the 
same physical interaction twice. The indices also prevent nonphysical self-repulsion 
terms, such as 1/722, from occurring.) Frequently used alternative notations for the 
double summation in Eq. (5-1) are 1 3n 2j 1/r;;, which counts each interaction twice 


and divides by two, and $7. jor x j Which is merely a shorthand symbol for the 
expression in Eq. (5-1). In each of these alternative notations, the summation is still 
over two indices, but the second ? ; symbol is “understood.” 

For the helium atom, Eq. (5-1) becomes (see Figure 5-1) 


1 1 
H(,2) 2 —; Vi - 5 V3 - Q/r) - 2/r2) + (l/r) (5-2) 
The helium hamiltonian (5-2) can be rewritten as 
H(1,2) 2 h(1) - AQ) + 1/ri2 (5-3) 
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e 
Nucleus 


Figure 5-1 »  Interparticle coordinates for a three-particle system consisting of two electrons and 
a nucleus. 


where 
1 
hG)— —,, Vj - Afri (5-4) 


In Eq. (5-3) we have merely grouped H into two one-electron operators and one two- 
electron operator. There is no way to separate this hamiltonian completely into a sum 
of one-electron operators without loss of rigor. However, if we wish to approximate the 
hamiltonian for helium in such a way that it becomes separable, we might try simply 
ignoring the interelectronic repulsion term: 


Happrox = h (1) T AQ) (5-5) 


If we do this, our approximate hamiltonian Happrox treats the kinetic and potential 
energies of each electron completely independently of the motion or position of the 
other. For this reason, such a treatment falls within the category of "independent 
electron approximations." 

Notice that each individual one-electron hamiltonian (5-4) is just the hamiltonian for 
a hydrogenlike ion, so it has as eigenfunctions the 1s, 2s, 2p, etc., functions of Chapter 4 
with Z —2. Such one-electron functions are referred to as atomic orbitals.! Represent- 
ing them with the symbol ¢; (e.g., $1 = 1s, $2 = 2s, $3 = 2px, $47 2py, etc.) we have, 
then, 


h (D$; (1) ^ eio; (1) (5-6) 


where e; is referred to as the orbital energy, or one-electron energy for atomic orbital 
$i. As we saw in Chapter 4, ej is given in atomic units by 


ci = = 522? (5-7) 


where n is the principal quantum number for ¢;, and Z is the nuclear charge. The “1” in 
Eq. (5-6) indicates that ¢; (1) is a function whose variable is the position of electron 1. 

We will now show that products of the atomic orbitals $ are eigenfunctions of 
Happrox. Let the general product of atomic orbitals for helium be written ¢;(1)@;(2). 
Then 


Approx $i (1) (2) = AC) HADA DG; 2) (5-8) 
— h (1) 9;(1)0;Q) +h 2) 6; (1) bj; 2) (5-9) 


'The term “atomic orbital" is used for any one-electron function used to describe the electronic distribution 
about an atom. 
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But ^ (1) does not contain any of the variables in ¢ ; (2), and so they commute. Similarly, 
h (2) and $j (1) commute, and 


Approx i (1) (2) = 6;Q)h (1) 9; (1) + 6; Dh (2) 6; Q) 
= ó;Q)eid;(1) + 6/(e;ó;Q) [from Eq. (5-6)] 
= (ci ej) 6((D6;0) = E Dp. (5-10) 
Thus, ¢;(1)@;(2) is an eigenfunction of Happrox, and the eigenvalue E is equal to 
the sum of the orbital energies. These results are yet another example of the general 
rules stated in Section 2-7 for separable hamiltonians. Indeed, once we recognized that 
Happrox is separable, we could have written these results down at once. 


Since the above terminology and results are so important for understanding many 
quantum-chemical calculations, we will summarize them here: 


1. The hamiltonian for a multielectron system cannot be separated into one-electron 
parts without making some approximation. 


2. Ignoring interelectron repulsion operators is one way to allow separability. 


3. The one-electron operators in the resulting approximate hamiltonian for an atom are 
hydrogenlike ion hamiltonians. Their eigenfunctions are called atomic orbitals. 


4. Simple products of atomic orbitals are eigenfunctions for the approximate hamilto- 
nian. 


5. Inthis approximation the total energy is equal to the sum of the one-electron energies. 


EXAMPLE 5-1 What electronic energy is predicted by the above approximation for 
the lithium atom in its ground state? What is the experimental value for the total 


electronic energy, given that the first and second ionization energies are 0.198 a.u. 
and 2.778 a.u.? 


SOLUTION » The ground state configuration for lithium is 1s22s, so Eapprox —2£1s + £25 = 


2 2 
2-4 . 5 a.u.) + (-4 . 3 a.u.) 2 —10.125a.u. The experimental value of E equals minus the 
sum of all three ionization energies. The first two values are given, and the third can be calculated 


2 

using the formula for one-electron ions: 7 E3 = —Epj2- = —( 1 . m) — 4.500a.u. Therefore, 
Eexp = — (0.198 + 2.778 + 4.500) a.u. = —7.476 a.u. Clearly, the approximate hamiltonian predicts 
an electronic energy that is much lower than the experimental value. 4 


E 5-2 Simple Products and Electron Exchange Symmetry 


In the independent particle model just described, the wavefunction for the lowest-energy 
state for helium is 1s(1)1s(2) since this has the lowest possible sum of one-electron 
energies. The electronic configuration for this state is symbolized 1s?, the superscript 
telling us how many electrons are in 1s orbitals. What might we expect for the electronic 
configuration of the lowest excited state? The answer is 1s2s (superscript “ones” are 
implicit). (At this point there is no reason for preferring this configuration to, say, 
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1s2p.;, but we shall show later that, in multielectronic systems, the 2s orbital has a 
lower energy than does a 2p orbital, even though they have the same principal quantum 
number.) Thus, we might write 


y (1,2) = 1s(1)2s(2) = J 8/x exp(—27)./ 1/z (1 — r2) exp(—72) (5-11) 
LEM RE EE E MN KM cR 
He* 1s He* 2s 


If one were to calculate r1, the average distance from the nucleus for electron 1, using this 
wavefunction, one would obtain a value of i a.u., consistent with the 1s state of a helium 
ion. For electron 2 one would find an average value, r2, of 3 a.u., characteristic of the 2s 
state (Problem 5-2). How does this correspond to what we would find experimentally? 

Before answering this question, we must recall that there are special problems asso- 
ciated with measuring the properties of an atomic system. The process of "seeing" 
electrons in atoms well enough to pinpoint their positions perturbs an atom so strongly 
that it cannot be assumed to be in the same state after the measurement. To get around 
this problem, we can assume that we have a very large number of identically prepared 
helium atoms, and that a single measurement of electronic positions will be made on 
each atom. It is assumed that the average of the instantaneous r values for a billion 
systems is identical to the average r value for a billion instants in a single undisturbed 
system. 

When we consider the measurement of average values for rı and rz in helium, we 
immediately encounter another problem. Say we can effect a simultaneous measure- 
ment of the two electronic distances in the first He atom. We call these rı and r2 and 
tabulate them for future averaging. Then we move on to a new helium atom and mea- 
sure rı and r2 for it. But we clearly have no way of identifying a particular one of these 
electrons with a particular one of the earlier pair. There is no connection between rı 
for one atom and r, for the next since all electrons are identical. If we want to know F, 
we can only average them all together and leave it at that. 

Thus, the wavefunction (5-11) does not seem to be entirely satisfactory since it 
enables us to calculate r4 Æ r2, something that is in principle impossible to measure. 
We need to modify the wavefunction so that it yields an average value for rı and r2 
(or for any quantity) that is independent of our choice of electron labels. This means 
that the electron density itself, given by (1, 2)?, must be independent of our electron 
labeling scheme. 

In a two-electron system like helium, there are only two ways to arrange the labels 
“1” and “2” in a single product function. For example, the product 1s2s can be written 


1s(1)2s(2) or = 2s(1)1s(2) (5-12) 
Squaring these gives two different functions, namely, 


1s?(1)2s?(2) = (8/z) exp(—4r1) (1/1) (1 — 2r2 + r2) exp(—2r2) 


2 2 2 (5-13) 
2s^(1)1s^(2) = (8/z) exp(—4r2)(1/z) (1 — 2ri - rj) exp(—2r1) 


These are different since they predict, for instance, different distributions for electron 1. 
The functions (5-12) are said to differ by an interchange of electron indices, or coor- 
dinates. (Since electron labels denote position coordinates, interchange of labels in 
the mathematical formula corresponds to interchanging positions of electrons in the 
physical model.) For y? to be invariant under such an interchange, it is necessary that 
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y itself be either symmetric or antisymmetric under the interchange. That is, if P is 
an interchange operator such that Pf(1, 2) = f (2, 1) then we need a y such that 


Py — cy (5-14) 
since then 
Pj?) (Py)! =y 2 y? 
One such wavefunction is given by the sum of eigenfunctions (5-12), 
Vs = (1/¥2)[1s(1)2s(2) + 28(1)18(2)] (5-15) 
since 
Py, = (1/V2)[18(2)2s(1) + 282)18(0] = vs 


(the factor 1/4/2 keeps the wavefunction normalized). Wavefunction (5-15) is thus 
symmetric under electron interchange. Is Eq. (5-15) still an eigenfunction for Happrox? 
Yes, because the eigenfunctions (5-12) are degenerate (both have E = €1s + €2s) and 
can therefore be mixed together in any way we choose and still be eigenfunctions. The 
antisymmetric combination is 


Va = (1/V2)[1s(1)2s(2) — 2s(1)18Q)] (5-16) 


Thus far we have shown that simple products of atomic orbitals give us two degen- 
erate eigenfunctions of Happrox for the configuration 1s2s and that these eigenfunctions 
fail to have the required symmetry properties for interchange of electron coordinates. 
But we have shown that, by taking the sum and difference of these simple products, we 
can form new eigenfunctions of Happrox that are respectively symmetric and antisym- 
metric with respect to the interchange of electron coordinates, so that y? is invariant 
to electron interchange. 

There is another way we can demonstrate that the helium atom eigenfunctions ought 
to be symmetric or antisymmetric for electron exchange: We can examine the hamilto- 
nian operator. We have shown in Chapter 2 that nondegenerate eigenfunctions must be 
symmetric or antisymmetric for any operation that leaves the hamiltonian unchanged 
and that degenerate eigenfunctions may always be mixed together in some combina- 
tion so that they too are symmetric or antisymmetric. This suggests that, for the case 
under discussion (the helium atom), the hamiltonian operator might be unchanged by 
an exchange of electrons. First we examine Happrox: 


P Happrox = P[A(1) + A(2)] = A (2) + AC) = Happrox (5-17) 


Our approximate hamiltonian is invariant to electron exchange, so any nondegenerate 
eigenfunctions must be symmetric or antisymmetric for interchange of electron labels 
(or positions). Only because the 1s2s configuration leads to degenerate eigenfunctions 
were we able to find unsymmetric? eigenfunctions like Eq. (5-12). This situation is 
reminiscent of the particle-in-a-ring system discussed in Chapter 2, where degenerate, 


2A function is unsymmetric for any operation that produces neither plus nor minus that function; i.e., if Pf = y 
and y Æ+ f, then f is unsymmetric under the operation P. 
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symmetric exponential eigenfunctions could be mixed to form degenerate unsymmetric 
trigonometric eigenfunctions. Let us now examine the full hamiltonian H: 


PH(1,2)= PIA) +A) + 1/ri] AQ) +A) + 1/7i— H(,2). (5-18) 


Since r12 and r21 are the same distance, it is evident that the exact H is likewise invariant 
to interchange of electron labels. Thus, we see that appeal either to physical argument 
or to the invariance of H and of Happrox to exchange of electrons leads us to recognize 
the need to impose symmetry conditions on the wavefunctions. 

We now summarize the points we have tried to convey in this section. 


1. A simple product function of the type 1s(1)2s(2) enables one to calculate different 
values of r for electrons 1 and 2. This makes no physical sense since the electrons 
are identical particles and hence are not physically distinguishable. 


2. Wavefunctions that overcome this difficulty must be either symmetric or antisym- 
metric with respect to exchange of electron labels (coordinates). 


3. The fact that this kind of “exchange symmetry” must be present is also (or alter- 
natively) seen from the fact that H (and also Happrox) is invariant under such an 
exchange operation. 


EXAMPLE 5-2 Given the functions f(x1) — Xe and g(x2) = exp(x»), show that, for 
xı =1,x2 =2, f(x1)g(x2) is unsymmetric for exchange of the two x positions, 


f (x) gGxo)*g(x1) f (x2) is symmetric, and f(x1)g(x2) — g(x1) f (x2) is antisym- 
metric. 


SOLUTION » For fg, we are examining what happens when x? exp(x2) turns into x$ exp(x1). 
That is, we are comparing 1? exp(2) to 2? exp(1). The resulting values are 7.389 and 10.873— 
obviously neither plus or minus times each other. fg+ gf equals 1? exp(2) + 22 exp(1), or 
7.389 + 10.873. After switching positions, we get 10.873 + 7.389, which is obviously the same 
thing. fg — gf gives 7.389 — 10.873. After switching, it gives 10.873 — 7.389, which is obviously 
minus one times the first value. 4 


N 5-3 Electron Spin and the Exclusion Principle 


Chemical and spectral evidence indicates that metals in Groups IA and IB of the periodic 
table are reasonably well represented by an electron configuration wherein one loosely 
held *valence" electron occupies an s orbital and all other electrons occur in pairs in 
orbitals of lower principal quantum number. Thus, lithium has a ground-state electronic 
structure approximated by the configuration 1s*2s, sodium by 1s22s22pÓ3s, copper by 
1s22s?2p63s?3pÓ3d!?4s, etc. (A configuration indicating that all orbitals of given n 
and / are doubly occupied, leaving no other electrons, is often called a closed shell. Thus, 
the above-cited examples each consist of a closed shell plus one s valence electron.) 
The observation that each atomic orbital in such configurations is occupied by no more 
than two electrons was without a theoretical explanation for some time. 
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When an atom like sodium is placed in an external magnetic field, what should 
be the magnetic moment of the atom due to orbital motion of the electrons? The 
s electrons should contribute nothing since, by definition of s,/ — 0 and hence the 
magnetic quantum number m = 0 for such electrons. An electron in a p orbital may 
have an orbital magnetic moment, but if all p levels (/ = 1, m = 4-1,0, —1) are equally 
occupied, the net magnetic moment should be zero. It is clear, then, that we might 
expect atoms in Groups IA and IB to possess no magnetic moment due to electron 
orbital motion. Nevertheless, Gerlach and Stern [1, 2] found that, when a beam of 
unexcited silver atoms is passed through an inhomogeneous magnetic field, it splits 
into two components as though each silver atom possesses a small magnetic moment 
capable of taking on either of two orientations in the applied field. (In a homogeneous 
magnetic field, the north and south poles of a magnetic dipole experience equal but 
oppositely directed forces, causing the dipole to become oriented. An example is a 
magnetic compass in the magnetic field of the earth. In an inhomogeneous magnetic 
field the poles experience opposite but unequal forces, causing the entire dipole to be 
accelerated through space in addition to being oriented.) Uhlenbeck and Goudsmit 
[3] and Bichowsky and Urey [4] independently suggested that the electron behaves 
as though it were a particle of finite radius spinning about its center of mass. Such a 
spinning particle would classically have angular momentum and, since it is charged, 
an accompanying magnetic moment.? 

If we accept the model of electron spin, then we can rationalize our experimental 
facts if we assume each electron is capable of being in one of but two possible states of 
opposite spin. This is done in the following way. If we attribute opposite spins to the 
two 1s electrons in, say, silver, their spin moments should cancel. Similarly, all other 
orbital-sharing electrons would contribute nothing if their spins were opposed. Only 
the outermost (5s) electron would have an uncanceled spin moment. Its two possible 
orientations would cause the beam to split into two components as is observed.^ 

The evident need for the introduction of the concept of electron spin means that our 
wavefunctions of earlier sections are incomplete. We need a wavefunction that tells us 
not only the probability that an electron will be found at given r, 0, @ coordinates in 
three-dimensional space, but also the probability that it will be in one or the other spin 
state. Rather than seeking detailed mathematical descriptions of spin state functions, 
we will simply symbolize them o and £. Then the symbol ¢(1)a(1) will mean that 
electron number | is in a spatial distribution corresponding to space orbital $, and that 
it has spin a. In the independent electron scheme, then, we could write the spin orbital 
(includes space and spin parts) for the valence electron of silver either as 5s(1)a(1) or 
5s(1)B(1). These two possibilities both occur in the atomic beam and interact differently 
with the inhomogeneous magnetic field. 

We now focus on the manner in which spin considerations affect wavefunction 
symmetry. The electrons are still identical particles, so our particle distribution must be 


This classical model, developed in the 1920s, is pedagogically useful and is responsible for the term spin, which 
is still employed to describe the fourth quantum number. However, it was not until 1948 and 1967 that mathematical 
studies of the properties of linearized equivalents of the Schódinger equation revealed the mathematical connection 
to this quantum number. For an entry to the literature, see Roman [10]. 

^ Actually, other experimental evidence, such as splitting of atomic spectral lines due to applied magnetic fields, 
was also available. Furthermore, experience with the quantum theory of orbital angular momentum played a role 
in the treatment of electron spin. The reader should not think that the historical development of quantum theory 
of spin was as naive or simple as we make it appear here. 
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insensitive to our choice of labels. This last statement is equivalent to saying that y must 
be symmetric or antisymmetric for interchange of electron space and spin coordinates. 
Let us examine this situation in the case of ground state helium and lithium atoms. 

In the independent electron approximation, the lowest-energy configuration for 
helium is 1s?. Let us write the various conceivable spin combinations for this config- 
uration. They are 


1s(Do(1)1s(2)o 2) a(1)a(2) (5-19) 
Is(Da (150080) | o1 0, | (O80) (5-20) 
Is(D8(1)1s2)a 2) LCa) (5-21) 
1s(1)8(1)1s(2)£ (2) B(1)8(2) (5-22) 


Itis easy to see that the common space term 1s(1)1s(2) is symmetric for electron inter- 
change. Likewise, o(1)o(2) and 8(1)8(2) are each symmetric, so Eqs. (5-19) and 
(5-22) are totally symmetric wavefunctions. The spin parts of Eqs. (5-20) and (5-21) are 
unsymmetric (not antisymmetric) for interchange, so these wavefunctions are not satis- 
factory. However, we can take the sum and difference of Eqs. (5-20) and (5-21) to obtain 


q/A2)[o (1)8Q) + 8(0)9)] (5-23) 
Q/42)l«(1)8Q) — 8009 Q)] (5-24) 


The 2-!? serves to maintain normality if we assume œ and f to be orthonormal: 


1s(1)1s(2) | 


f Dawawa) = f eoe aoay-i (5-25) 


[ «oso = [ ete) 2o (5-26) 


Here we use integrals and a differential element dw in a “spin coordinate œ.” This is 
notationally convenient but not, for our purposes, worth delving into. We can interpret 
integration over o to be in effect equivalent to summing over the possible electron 
indices. If, for a particular electron index, the spins agree, then the integral equals unity. 
Ifthey disagree, the integral vanishes. Wavefunction (5-23) consists of symmetric space 
and spin parts, so it is overall symmetric. Wavefunction (5-24) contains a symmetric 
space part times an antisymmetric spin part, so it is overall antisymmetric. We have 
succeeded, then, in writing down four wavefunctions for the configuration 1s? having 
proper symmetry for electron interchange. Three of these, Eqs. (5-19), (5-22), (5-23), 
are symmetric and one, Eq. (5-24), is antisymmetric. Experimentally, we know that 
the ground state of helium is a singlet, that is, there is but one such state. This suggests 
that the wavefunction must be antisymmetric for exchange of electron space and spin 
coordinates. 


EXAMPLE 5-3 We have just shown four wavefunctions resulting from four spin 


functions times a symmetric space part (152). Can we manipulate the 1s? configu- 
ration to obtain an antisymmetric space part, as we did for the 1s2s configuration? 
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SOLUTION » We can try to produce an antisymmetric space function by taking the product 
difference 1s(1)1s(2)-1s(2)1s(1). Since these two products are really the same, this combination 
equals zero. Thus, whereas two different orbitals can be arranged in two product combinations, one 
symmetric and the other antisymmetric, we find that a single orbital, doubly occupied, can appear 
only in one simple product, which must be symmetric for electron exchange. 4 


Now let us try lithium. The lowest-energy configuration should be 1s?, and we can 
write eight unique space-spin orbital products: 


a (1)a 2) (3) (5-27) 
o (1)a(2) B(3) (5-28) 
a(1)B(2)a(3) (5-29) 
Is(1)1s(2)18(3) ^ (00029) ea 
a(1)B(2)B(3) (5-31) 
B(DaQ)8() (5-32) 
BC) B(2)a(3) (5-33) 
B(1)8(2)8() (5-34) 


Of these, the first and last are totally symmetric for all electron interchanges. The 
remaining six are unsymmetric for two out of three possible interchanges. Can we 
make appropriate linear combinations of these as we did for helium? Let us try. The 
problem is simplified by recognizing that, if we start with, say, two o''s and one f, 
we still have that number of w’s and f's after interchange of electron labels. Hence, 
we mix together only functions that agree in total numbers of o's and fs, i.e., (5-28), 
(5-29), (5-30) with each other, or (5-31), (5-32), (5-33) with each other. Let us try the 
sum of (5-28), (5-29), and (5-30). Ignoring normalization, this gives the spin function 


a(1)o(2)B (3) - a (10)8Q)a (3) + BU )a(2)a(3) (5-35) 
Interchanging electron spin coordinates 1 and 2 gives 
a(2)a(1)B (3) + a (2)B(1)a (3) + Laa) 


which, upon reordering each product, is easily seen to be identical to (5-35). The same 
result arises from interchanging 1 and 3 or 2 and 3, and so (5-35) is symmetric for all 
interchanges. The sum of (5-31), (5-32), and (5-33) is likewise symmetric. Can we find 
any combinations that are totally antisymmetric? A few attempts with pencil and paper 
should convince one that it is impossible to find a combination that is antisymmetric 
for all interchanges. Experimentally, we know that no state of lithium corresponds to 
a Is? configuration. 

To summarize, we have found that for the configuration 1s? we can write three wave- 
functions that are symmetric and one that is antisymmetric under exchange of electron 
space and spin coordinates, while for the configuration 1s? we can construct symmetric 
or unsymmetric wavefunctions, but no antisymmetric ones. The physical observation 
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is that atoms exist in only one state having an electronic structure approximately repre- 
sented by the configuration 1s”, but that there are no atoms having any state represented 
by 1s?. This and other physical evidence has led to the recognition of the exclusion prin- 
ciple: Wavefunctions must be antisymmetric with respect to simultaneous interchange 
of space and spin coordinates of electrons? In invoking the exclusion principle, we 
exclude all of the 1s? wavefunctions and three out of the four wavefunctions we were 
able to construct for the ground state of helium, leaving (5-24) as the only acceptable 
wave-function. 

We have seen that the ground state configuration of lithium cannot be 1s?. Can we 
satisfy the exclusion principle with the next-lowest energy configuration 1s*2s? We 
will try to find a satisfactory solution, but our manipulations will be simplified if we 
streamline our notation. We will write a function such as 1s(1)1s(2)2s(3)a (1) 8 (2)o (3) 
as 1sls2swBa, allowing position in the sequence to stand for the electron label. Inter- 
changing electrons 1 and 2 is then represented by switching the order of space functions 
in positions 1 and 2 and spin functions in positions 1 and 2 thus 


1s1s2so fo: > 1s1s2sBoa (5-36) 


This interchange produced a new function rather than merely reversing the sign 
of our starting function. But if we take the difference between the two products 
in Eq. (5-36), we will have a function that is antisymmetric to 1, 2 interchange: 
1s1s2s(o Bo — Baa). Now we subject this to a 1, 3 interchange and the new products 
produced are subtracted to give a function that is antisymmetric to 1, 3 interchange: 
Isls2s(aBa — Bao) —2slsls(o Bo — oro). The first pair of terms is still not antisym- 
metric to 2, 3 interchange, and the second pair is not antisymmetric to 1, 2 interchange. 
We can use either one of these interchanges to produce two new terms to subtract. 
Either way, the resulting wavefunction, antisymmetric for all interchanges, is 


Es 
V6 


The factor 6~!/* normalizes (5-37) since all of the six space-spin products are normal- 
ized and orthogonal to each other product by virtue of either space-orbital or spin-orbital 
disagreement, or both. Note that, whereas the two-electron wavefunction for helium 
was separable into a single space function times a spin function, the lithium wavefunc- 
tion must be written as a linear combination of such products. This is usually true when 
we deal with more than two electrons. 

Since 1s?2s is the lowest-energy configuration for which we can write an anti- 
symmetrized wavefunction, this is the ground state configuration for lithium in this 
independent-electron approximation. 

In summary, phenomenological evidence suggests that an electron can exist in either 
of two “spin states” in the presence of a magnetic field. Writing wavefunctions including 
spin functions and comparing these with experimental facts indicates that states exist 
only for wavefunctions that satisfy the exclusion principle. 


[1s1s2s(o Bo — Baa) + 1s2sls(Baa — ao) 4-2slsls(aoaB —ofo)] (5-37) 


5A broader statement is: Wavefunctions must be antisymmetric (symmetric) with respect to simultaneous 
interchange of space and spin coordinates of fermions (bosons). A fermion is characterized by half-integral 
spin quantum number; a boson is characterized by integral spin quantum number. Electrons have spin quantum 
number 7 and are therefore fermions. 


Section 5-4 Slater Determinants and the Pauli Principle 137 


N 5-4 Slater Determinants and the Pauli Principle 


It was pointed out by Slater [5] that there is a simple way to write wavefunctions 
guaranteeing that they will be antisymmetric for interchange of electronic space and spin 
coordinates: one writes the wavefunction as a determinant. For the 1s*2s configuration 
of lithium, one would write 


Is(Do(1) 1s(2)a(2) 1s(3)æ(3) 
Is) BC) 1s) | 1s08) (5-38) 
2s(1)a(1) 2sQ)e(2) 2s(3)a(3) 


d 
Expanding this according to the usual rules governing determinants (see Appendix 2) 
gives 
v= ODODO) + 2s(1)a(1)1s(2)a@(2)18(3) 6G) 
+1s(1)BC1)2s(2)a(2)1s(3)a(3) — 2s(1)æ (1) 1s(2)8 (2) 1s3)a (3) 
—1s(1)8(1)1s(2)a (2)2s(3)a (3) — 1s(1)æ (1)2s(2)æ (2)1s(3)83)] (5-39) 


This can be factored and shown to be identical to wavefunction (5-37) of the preceding 
section. 

A simplifying notation in common usage is to delete the œ, 6 symbols of the spin- 
orbitals and to let a bar over the space orbital signify 6 spin, absence of a bar being 
understood to signify o spin. In this notation, Eq. (5-38) would be written 


Is) 1sQ) 1s(3) 


15(1) 1s(2) 15(3) (5-40) 
2s(1) 2s(2) 2s(3) 


"E 


The general prescription to follow in writing a Slater determinantal wavefunction is 
very simple: 


1. Choose the configuration to be represented. 1s182s was used above. (Here we write 
1s182s rather than 1s?2s to emphasize that the two 1s electrons occupy different spin- 
orbitals.) For our general example, we will let U; stand for a general spin-orbital 
and take a four-electron example of configuration Uj U2U3U4. 


2. For n electrons, set up an n x n determinant with (n!)~!/? as normalizing factor. 


Every position in the first row should be occupied by the first spin-orbital of the 
configuration; every position in the second row by the second spin-orbital, etc. Now 
put in electron indices so that all positions in column 1 are occupied by electron 1, 
column 2 by electron 2, etc. 


In the case of our four-electron configuration, the recipe gives 


U, (D U2) UB) U4) 

1 [U(1) U5Q) U3) U2(4) 
NUT 5-41 
E V4! [U0 U3(2) VB) Us(4) e 

U,(1) U4Q) U4Q) Ua(4) 
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Notice that the principal diagonal (top left to bottom right) contains our original con- 
figuration Ui U2U3U4. Often, the Slater determinant is represented in a space-saving 
way by simply writing the principal diagonal between short vertical bars. The nor- 
malizing factor is deleted. Thus, Eq. (5-41) would be symbolized as |U;(1)U2(2) 
U3(3)U4(4)]. 

We have indicated the general recipe for writing down a Slater determinant, and we 
have seen that, for the configuration 1s152s, this gives an antisymmetric wavefunc- 
tion. Now we will give a general proof of the antisymmetry of such wavefunctions 
for exchange of electrons. We have already seen that interchanging the space and spin 
coordinates of electrons 1 and 2 corresponds to going through the wavefunction and 
changing all the 1s to 2s and vice versa; i.e., electron labels denote coordinates. In 
a Slater determinant, interchanging electron labels 1 and 2 is the same thing as inter- 
changing columns 1 and 2 of the determinant. [See Eq. (5-41) and note that columns 1 
and 2 differ only in electron index.] But a determinant reverses sign upon interchange 
of two rows or columns. (See Appendix 2 for a summary of the properties of determi- 
nants.) Hence, any Slater determinant reverses sign (i.e., is antisymmetric) upon the 
interchange of space and spin coordinates of any two electrons. 

Suppose we tried to put two electrons into the same space-orbital with the same spin. 
This would require that the same spin-orbital be written twice in the configuration, 
causing two rows of the Slater determinant to be identical. [If both 1s electrons in 
Eq. (5-40) had @ spin, the bars would be absent from row 2.] We just stated that 
the determinant must reverse sign upon interchange of two rows. If we interchange 
two identical rows, we change nothing yet the sign must reverse: the determinant 
must be equal to zero. Thus, the determinantal wavefunction vanishes when we try 
to put more than one electron into the same spin-orbital, indicating that this is not a 
physically allowed situation. This is a generalization of our earlier discovery that no 
1s? configuration is allowed by the exclusion principle, such a configuration requiring 
at least two electrons to have the same space and spin functions. 

This restriction on allowable electronic configurations is more familiar to chemists as 
the Pauli principle: /n assigning electrons to atomic orbitals in the independent electron 
scheme, no two electrons are allowed to have all four quantum numbers (n, l,m, spin) 
the same. The Pauli principle is a restatement of the exclusion principle as it applies in 
the special case of an orbital approximation to the wavefunction. 


N 5-5 Singlet and Triplet States for the 1s2s Configuration 
of Helium 


We showed in Section 5-2 that two space functions having proper space symmetry 
could be written for the configuration 1s2s. One was symmetric (Eq. 5-15) and one 
was antisymmetric (Eq. 5-16). Now we find that spin functions must be included in 
our wavefunctions, and in a way that makes the final result antisymmetric when space 
and spin coordinates are interchanged. We can accomplish this by multiplying the 
symmetric space function by an antisymmetric spin function, calling the result y, a. 
Thus, 


Vsa(1,2) = (1/2) [15(028Q) + 2s(1)18)] 1/2 [e (08) — 8008]. (5-42) 
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Alternatively, we can multiply the antisymmetric space term by any one of the three 
possible symmetric spin terms: 


a(1)a(2) (5-43a) 
Was(1, 2) = (1/V2)[18(1)2s(2) — 2s(1)1s2)] (1//2)[@(1)B(2) + B(a(2)] | (5-43b) 
EDLO) (5-43c) 


All four of these wavefunctions satisfy the exclusion principle and each is linearly 
independent of the others, indicating that four distinct physical states arise from the 
configuration 1s2s. 

There are a number of important points that can be illustrated using these wave- 
functions. The first has to do with Slater determinants. Let us write down a Slater 
determinantal expression corresponding to wavefunction (5-432). The configuration is 
1s(1)a(1)2s(2)o (2), giving the Slater determinant (where absence of a bar indicates 
a spin) 


1 
as(1.2) = — 
Was(1, 2) J 


which, upon expansion, gives us Eq. (5-43a). If we attempt the same process to 
obtain Eq. (5-43b), we encounter a difficulty. The configuration 1s(1)@(1)2s(2)6(2) 
leads to a 2 x 2 determinant, which, upon expansion, gives two product terms, 
whereas Eq. (5-43b) involves four product terms. The Slater determinantal functions 
corresponding to Eqs. (5-42) and (5-43b) are, in fact, 


Is) 1sQ) 


2s(1) 2s(2) oan) 


1 


1 
JB | JA ME 
The lesson to be gained from this is that a single Slater determinant does not always 
display all of the symmetry possessed by the correct wavefunction. (In this particular 
case, a single determinant restricts one of the AOs to o spin and the other to 8, which 
is an artificial limitation.) 

Next we will investigate the energies of the states as they are described by these 
wavefunctions. We have already pointed out that they are degenerate eigenfunctions 
of Happrox, but we will now examine their interactions with the full hamiltonian (5-2). 
Since our wavefunctions are not eigenfunctions of this hamiltonian, we cannot compare 
eigenvalues. Instead we must calculate the average values of the energy for each 
wavefunction, using the formula 


Is(1) 1sQ)| 1! 


28(1) 28) 


1s(1) 152) 


Vsa(1,2) = 2s(1) 2sQ) 


| (5-45) 


f v* Hy dt 
[ VV dt 
The symbol “dt” stands for integration over space and spin coordinates of the electrons: 
dt =dvdw. Since both space and spin parts of our wavefunctions are normalized 


[cf. Eqs. (5-25) and (5-26)], the denominator of Eq. (5-46) is unity and may be ignored. 
The energy thus is given by the expression 


E- (5-46) 


2 1 1 
E= J v 51 5 VÀ (2/r1) ra) + Qna) | wa (5-47) 
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Notice that the energy operator H contains no terms that would interact with spin 
functions o and f. (Such terms do arise at higher levels of refinement, but we ignore 
them for now.) Hence, the spin terms of w can be integrated separately, and, since all 
spin factors in Eqs. (5-42) and (5-43) are normalized, this gives a factor of unity in all 
four cases. This means that the average energies will be entirely determined by the 
space parts of the wavefunctions. This, in turn, means that all three states (5-43), which 
have the same space term, will have the same energy but that the state approximated by 
the function (5-42) may have a different energy. If our approximate representation of 
the exact eigenfunctions is physically realistic, we expect helium to display two excited 
state energies in the energy range consistent with a 1s2s configuration. Furthermore, 
we expect one of these state energies to be triply degenerate. 

Which of these two state energies should be higher? To determine this requires that 
we expand our energy expression (5-47) for each of the two space functions (5-42) 
and (5-43). 


- 1 1 1 
E=- / [1s*(1)2s*(2) + 2s*(1)1s*(2)] —-V? — -v2 — (2/r1) 
3 2 2 2 
— (2/n)+ a/r [1s(1)2s(2) + 2s(1)1s(2)]dv(1)dv(2) (5-48) 
(The subscript on E refers to the degeneracy of whichever energy level we are consider- 
ing.) This expands into a large number of terms. Integrals over one-electron operators 


may be written as products of two integrals, each over a different electron. Thus, the 
expansion over the kinetic energy operators gives 


a} [| orvendo E 
+ | 2s*(2)[ 1-4000) | 1s* wl) 
+ fa 2s*()L— -viasat | 1st yis) 
«| 1s* QE L-40? d2) | 2:ttastrdid 
E | 1s*(1)T—$V 7 ]2s(1) do(1) | 2s* 09s) de) 
+ | 2s*(2)[ —3V 7] 19(2) do(2) | 1s*t52stty du) 
+ | 2s*(1)[—4V,?]1s(1) do(1) f 12L) 
+ | 1s*(2)[ 3,1252) do(2) | astayta} - 


6We have already shown that, if the 1/742? term is absent, the energy is equal to Ej, + Es, for Het, which is 
equal to —2.5 a.u. Therefore, the detailed breakdown leading to Eqs. (5-49)-(5-51) is not necessary. However, we 
will present it in detail in the belief that some students will benefit from another specific example of integration 
of two-electron products over one-electron operators. 
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The orthogonality of the 1s and 2s orbitals causes the terms preceded by + to vanish. 
Furthermore, integrals that differ only in the variable label [such as those in the second 
and third terms of (5-49)] are equal, so that this expansion becomes 


IEEE sodu f 2*0) aa 2s(1) dv(1) (5-50) 
Expansion of Eq. (5-48) over (—2/r, — 2/ry) proceeds analogously to give 
[iD zmian f 20 -2/r) 280) v (5-51) 
The final term in the hamiltonian, 1/712, occurs in four two-electron integrals: 
; Ul 1s*(1)25* 2) (1/r12)1s(1)28(2) dv(1) dv 2) 
+ Jl 25* (1)1s*Q) (1/r12)2s(1)1 Q2) dv(1) dv2) 
«ff 1s*(1)2s*(2)(1/r12)2s(1)1s(2) dv(1) dv(2) 
+ Jl DADO dv) v] (5-52) 


The first two integrals of (5-52) differ only by an interchange of labels “1” and “2 ," and 
so they are equal to each other. The same is true of the second pair. Thus, the average 
value of the energy is 


Ej = ul Is*(1) |-5vi] soas f 1s¥*(1)[—2/rı] 1s(1) dv(1) 


NET |-svi] 2s(1) dw(1) + [2m t-2/masoy avo 


+ff 1s*(1)2s*(2)(1/712) 1s(1)2s(2) du(1) dv(2) 
«ff DOUA dd] (5-53) 


Notice that, since — IV? —2/r is the hamiltonian for He*, the first two integrals of 
Eq. (5-33) combine to give the average energy of He* in its 1s state. The second pair 
gives the energy for He* in the 2s state. Thus, Eq. (5-53) can be written 


E= Es + Exs + JK (5-54) 
3 


where J and K represent the last two integrals in Eq. (5-53). No bars appear on Eis 
or E», because these "average energies" are identical to the eigenvalues for the He* 
hamiltonian (Problem 5-15). 

The integral J denotes electrons 1 and 2 as being in "charge clouds" described by 
1s*1s and 2s*2s, respectively. The operator 1/ri» gives the electrostatic repulsion 
energy between these two charge clouds. Since these charge clouds are every where neg- 
atively charged, all the interactions are repulsive, and it is necessary that this “coulomb 
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Figure 5-2 » The function produced by multiplying together hydrogenlike 1s and 2s orbitals. R 
is the radius of the spherical nodal surface. 


integral" J be positive. Alternatively, we can argue that the functions 1s*1s, 2s*2s, 
and 1/rj2 are everywhere positive, so the integrand of J is everywhere positive and J 
must be positive. 

The integral K is called an "exchange integral" because the two product functions in 
the integrand differ by an exchange of electrons. This integral gives the net interaction 
between an electron “distribution” described by 1s*2s, and another electron in the same 
distribution. (These distributions are mathematical functions, not physically realizable 
electron distributions.) The 1s2s function is sketched in Fig. 5-2. Because the 2s orbital 
has a radial node, the 1s2s function (which is the same as 1s*2s since the 1s function 
is real) also has a radial node. Now the function 1s(1)2s(1)1s(2)2s(2) will be positive 
whenever r, and r» are either both smaller or both larger than the radial node distance 
(R in Figure 5-2). But when one r value is smaller than R and the other is greater, 
corresponding to the electrons being on opposite sides of the nodal surface, the product 
1s(1)2s(1)1s(2)2s(2) is negative. These positive and negative contributions to K are 
weighted by the function 1/745, which is always positive and hence unable to affect 
the sign of the integrand. But 1/712 is smallest when the electrons are far apart. This 
means that 1/712 tends to reduce the contributions where the electrons are on opposite 
sides of the node (i.e., the negative contributions), and so the value of K turns out to be 
positive (although not as large in magnitude as J, which has no negative contribution 
at all). 

Since the integral K is positive, we can see from Eq. (5-54) that the triply degenerate 
energy level lies below the singly degenerate one, the separation between them being 
2K. (We note in passing that these independent-electron wavefunction energies are not 
simply the sum of one-electron energies as was the case when we used Happrox, thereby 
ignoring interelectronic repulsion.) 

The experimental observation agrees qualitatively with these results. There are two 
state energies associated with the 1s2s configuration. When the atom is placed in an 
external magnetic field, the lower-state-energy-level splits into three levels. The state 
having the higher energy has a “multiplicity” of one and is called a singlet. The lower- 
energy with multiplicity three is called a triplet. (The reference to a "triplet state" 
should not be construed to mean that this is one state. It is a triplet of states.) 

Itis possible to use vector arguments similar to those presented in Chapter 4 to under- 
stand why the triply degenerate level splits into three different levels in the presence of 
a homogeneous magnetic field. Let us first consider the case of a single electron. We 
have already indicated that two spin states are possible, which we have labeled « and £. 
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In a magnetic field the angular momentum vectors precess about the field axis z, as 
depicted in Figure 5-3. The z components of the angular momentum vectors are con- 
stant but the x and y components are not. Because the allowed z components must 
in general differ by one atomic unit (stated but not proved in Chapter 4), and because 
there are but two allowed values (inferred from observations such as the splitting of 
a beam of silver atoms into two components), and because the two kinds of state are 
oppositely affected by magnetic fields, it is concluded that the z components of angular 
momentum (labeled ms) are equal to H and -i a.u. for œ and £, respectively. Fol- 
lowing through using orbital angular momentum relations as a model, we postulate an 
electron spin quantum number s equal to the maximum z-component of spin angular 
momentum in a.u., 1, and a spin angular momentum vector s having length 4/s(s + 1) 
a.u. The degeneracy, gs — 2s + 1, equals 2, in agreement with the two orientations in 
Fig. 5-3. 

As noted in Chapter 4, half-integer quantum numbers correspond to eigenfunctions 
that cannot be expressed as spherical harmonics. We will not pursue the question 
of detailed expressions for œ and £ here. (However, see the problems at the end of 
Chapter 9.) 

Now let us turn to the two-electron system. We will assume that the magnetic 
moments of the two electrons interact independently with the external field. This 
ignores the fact that each electron senses a small contribution to the magnetic field 
resulting from the magnetic moment of the other electron. 

Another factor that could affect the magnetic field sensed by the spin moment is the 
magnetic moment resulting from the orbital motions of the electrons, although this is 
not present if both electrons are taken to be in s atomic orbitals (AOs). For two electrons, 
we can imagine four situations: aa, o, Ba, and BB. We pointed out earlier (Section 
4-5) that, for a system composed of several moving parts, the total angular momentum 
is the sum of the individual angular momenta, and the z component is the sum of the 
individual z components. For the four spin situations listed above, this means that the 
net z components of spin angular momentum (labeled M,) are 4-1, 0, 0, and —1 a.u., 
respectively. The spin combinations o + 6a [from the triplet (5-43b)] and a6 — Ba 


Figure 5-3 » The angular momentum vectors for œ and f precess around the magnetic field axis 
z. The z components of these vectors are constant and have values of +4 and -1 a.u. respectively. 
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Figure 5-4 » Energy levels for singlet and triplet levels of 1s2s helium in (a) absence, and 
(b) presence of an external magnetic field. 


[from the singlet (5-42)] are linear combinations of of and Ba. However, since these 
two functions have the same value for the z component of angular momentum (zero), 
their linear combinations will also have that value. It follows that the z components 
of the spin angular momenta of the triplet of states (5-43) are +1, 0, and —1 a.u., and 
for the singlet (5-42) it is zero. Because the electrons are charged, these spin angular 
momenta correspond to spin magnetic moments, which interact differently with the 
applied magnetic field to give splitting of the triplet (see Figure 5-4). It is customary 
to refer to all three spin states in (5-43) as having parallel spins even though the vector 
diagram for the (5-43b) state is not particularly in accord with this terminology. For 
the singlet state, the spins are said to be opposed, or antiparallel. 


[jJ 5-6 The Self-Consistent Field, Slater-Type Orbitals, 


and the Aufbau Principle 


Up to now we have used wavefunctions that, while not being eigenfunctions of the 
hamiltonian, are eigenfunctions of an "effective hamiltonian" obtained by ignoring the 
interelectronic repulsion operator 1/7;;. That is, these wavefunctions would be exactly 
correct if the electrons in helium were attracted by the nucleus, but did not repel each 
other. For this reason, we have referred to this as an independent electron approxi- 
mation. Because we have neglected interelectronic repulsion, we cannot expect such 
a wavefunction to give very good numerical predictions of charge density or energy. 
We can compare the energy of He in the 1s? (ground) state as predicted by our inde- 
pendent electron wavefunction and Happrox(—108.84 eV) with the experimental value 
(—79.0143 eV). (See Table 5-1.) This shows that the predicted energy is much too low, 
which is understandable since we have neglected an important repulsive (hence positive) 
interaction energy. But we can account for much of this neglected energy by calculating 
the average value of the energy using H (with 1/71? included) instead of Happrox. This 
gives a value of —74.83 eV—much better, though now too high by more than 4 eV. Even 
though we have now accounted for interelectronic repulsion, there is still a problem: 
Because we ignored interelectronic repulsion in arriving at these wavefunctions, they 
predict electron densities that are too large near the nucleus. In reality, interelectronic 
repulsion prevents so much build-up of charge density. Methods have been devised that 
partially overcome this problem by retaining the convenient form of orbital products 
but modifying the formulas for the orbitals themselves to make them more diffuse. 
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TABLE 5-1 » Average Values for Energy Calculated from Helium Atom Ground State 
Approximate Wavefunctions^ 


Wavefunction description E(eV) 
1) Product of He* orbitals —74.83 
2) Product of hyrdrogenlike orbitals with ¢ fixed by SCF 
method —77.48 
3) Best product-type wavefunction —77.870917 


4) Nonorbital wavefunction of Pekeris [9]. This wavefunction 
uses functions of r1, r2 and r12 as coordinates and has 
the form of an exponential times a linear combination of 
1078 terms —79.009469 12 


aE = f * Hwdt/f w* ydr, where H is given by Eq. (5-2). 


Let the ground state of helium be our example. We take the ordinary independent- 
electron wavefunction as our initial approximation: 


1s(1)15(2) = y 8/z exp( —2r1)/ 8/7 exp(—2r2) (5-55) 


These atomic orbitals are correct only if electrons 1 and 2 do not "see" each other via 
a repulsive interaction. They really do repel each other, and we can approximate this 
repulsion by saying that electron 2 "sees" electron 1 as a smeared out, time-averaged 
charge cloud rather than the rapidly moving point charge that is actually present. The 
initial description for this charge cloud is just the absolute square of the initial atomic 
orbital occupied by electron 2: [1s(2)]*. Our approximation now has electron 1 moving 
in the field of a positive nucleus embedded in a spherical cloud of negative charge. 
Thus, for electron 1, the positive nuclear charge is “shielded” or "screened" by electron 
2. Hence, electron 1 should occupy an orbital that is less contracted about the nucleus. 
Let us write this new orbital in the form 


Is (1) 2 J £3/n exp(—£ri) (5-56) 


where £ is related to the screened nuclear charge seen by electron 1. The mathematical 
methods used to evaluate Z will be described later in this book. 

Next we turn to electron 2, which we now take to be moving in the field of the 
nucleus shielded by the charge cloud due to electron 1, now in its expanded orbital. 
Just as before, we find a new orbital of form (5-56) for electron 2. The value of Z 
that we calculate for electron 2, however, will be different from what we found for 
electron 1 because the shielding of the nucleus by electron 1 is different from what it 
was by electron 2 in our previous step. We now have a new distribution for electron 
2, but this means that we must recalculate the orbital for electron 1 since this orbital 
was appropriate for the screening due to electron 2 in its old orbital. After revising 
the orbital for electron 1, we must revise the orbital for electron 2. This procedure 
is continued back and forth between electrons 1 and 2 until the value of ¢ converges 
to an unchanging value (under the constraint that electrons 1 and 2 ultimately occupy 
orbitals having the same value of ¢). Then the orbital for each electron is consistent 
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with the potential due to the nucleus and the charge cloud for the other electron: the 
electrons move in a "self-consistent field" (SCF). 

The result of such a calculation is a wavefunction in much closer accord with the 
actual charge density distributions of atoms than that given by the complete neglect of 
interelectron repulsion.’ A plot of the electron density distribution in helium as given 
by wavefunction (5-55) and by a similar wavefunction with optimized ¢ is given in 
Fig. 5-5. Because each electron senses only the time-averaged charge cloud of the 
other in this approximation, it is still an independent-electron treatment. The hallmark 
of the independent electron treatment is a wavefunction containing only a product of 
one-electron functions. It will not contain functions of, say, r12, which would make y 
depend on the instantaneous distance between electrons 1 and 2. 

Atomic orbitals that are eigenfunctions for the one-electron hydrogenlike ion (for 
integral or nonintegral Z) are called hydrogenlike orbitals. In Chapter 4 we noted that 
many hydrogenlike orbitals have radial nodes. In actual practice, this mathematical 
aspect causes increased complexity in solving integrals in quantum chemical calcula- 
tions. Much more convenient are a class of modified orbitals called Slater-type orbitals 
(STOs). These differ from their hydrogenlike counterparts in that they have no radial 
nodes. Angular terms are identical in the two types of orbital. The unnormalized radial 
term for an STO is 


R(n, Z,s)=r"—" exp[-(Z — s)r/n] (5-57) 


2.2 


2.0 


Anr? y? 


08 SCF 


0.0 1.0 2.0 3.0 


r (a.u.) 


Figure 5-5 » Electron distribution in helium as given by SCF and unshielded independent electron 
approximations. 


"In practice, mathematical techniques have been found that lead to a self-consistent solution without explicit 
iteration between evolving AOs that converge to some final optimized £. Examples are described in Chapters 7 
and 12. A thorough discussion of the SCF and related methods is given in Chapter 11. 
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where Z is the nuclear charge in atomic units, n is the principal quantum number, and 
s is a "screening constant" which has the function of reducing the nuclear charge Z 
"seen" by an electron. Slater [7] constructed rules for determining the values of s 
that will produce STOs in close agreement with those one would obtain by an SCF 
calculation. These rules, appropriate for electrons up to the 3d level, are 


1. The electrons in the atom are divided up into the following groups: 1s|2s,2p|3s, 
3p|3d. 


2. The shielding constant s for an orbital associated with any of the above groups is 
the sum of the following contributions: 


a) Nothing from any electrons in groups to the right (in the above list) of the group 
under consideration 

b) 0.35 from each other electron in the group under consideration (except 0.30 in 
the 1s group) 

c) If the orbital under consideration is an s or p orbital, 0.85 for each electron with 
principal quantum number less by 1, and 1.00 for each electron still “farther in"; 
for a d orbital, 1.00 for all electrons farther in 


For example, nitrogen, with ground state configuration 1s?2s? 2p?, would have the 
same radial part for the 2s and 2p STOs. This would be given by the formula (n = 2, 
Z=7,s =4 x 0.35+2 x 0.85 23.1) 


Ros,» (2, 7, 3.1) =r- P exp[- (7 — 3.1)r/2] =r exp(—1.95r) 
For the 1s level, n = 1, Z — 7, s = 0.30, and 
Ris =exp(—6.7r) 


Comparing orbital exponents, we see that the 1s charge cloud is compressed much 
more tightly around the nucleus than are the 2s and 2p “valence orbital” charge clouds. 
Slater-type orbitals are very frequently used in quantum chemistry because they provide 
us with very good approximations to self-consistent field atomic orbitals (SCF—AOs) 
with almost no effort. 

Clementi and Raimondi [8] have published a refined list of rules for the shielding 
constant, which extends to the 4p level. Their rules include contributions to shielding 
due to the presence of electrons in shells outside the orbital under consideration. Such 
contributions are not large, and, up to the 3d level, there is reasonably good agreement 
between these two sets of rules. 

The fact that STOs have no radial nodes results in some loss of orthogonality. Angular 
terms still give orthogonality between orbitals having different / or m quantum numbers, 
but STOs differing only in their » quantum number are nonorthogonal. Thus, 1s, 2s, 
3s,... arenonorthogonal. Similarly 2pz, 3pz,... or 3dxz,4dxy, ... are nonorthogonal. 
In practice, this feature is handled easily. The only real problem arises if one forgets 
about this nonorthogonality when making certain calculations. 

When carrying out SCF calculations on multielectronic atoms, one finds that the 
orbital energies for 2s and 2p functions are not the same. Similarly, 3s 3p, and 3d orbitals 
are nondegenerate. Yet these orbitals were degenerate in the one-electron hydrogenlike 
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system in which energy was a function of n but not of / or m. Why are these orbital 
energies nondegenerate in the many-electron calculation? A reasonable explanation can 
be found by considering the comparative effectiveness with which a pair of 1s electrons 
screen the nucleus from a 3s or a 3p electron. Comparing the 3s, 3p, and 3d hydrogenlike 
orbital formulas in Table 4-2 shows that the 3s orbital is finite at the nucleus, decreasing 
proportionally to r for small r. The 3p orbitals vanish at the nucleus but grow as r for 
small r. The 3d orbitals vanish at the nucleus but grow as r? for small r. The result of 
all this is that an s electron spends a larger amount of its time near the nucleus than a p 
electron of the same principal quantum number, the p electron spending more time near 
the nucleus than the d, etc. Hence, the s electron penetrates the "underlying" charge 
clouds more effectively and is therefore less effectively shielded from the nucleus. 
Since the s electron sees a greater effective nuclear charge, its energy is lower than 
that of the p electron. (This effect is not obvious in STOs since the 3s and 3p STOs 
have the same radial function which vanishes at the nucleus. However, the STO for 3d 
does reflect the nondegeneracy since Slater's rules give it a different screening constant 
from 3s or 3p.) 

The tendency for higher / values to be associated with higher orbital energies leads 
to the following orbital ordering: 


1s 2s 2p 3s 3p 4s 3d 4p 5s 4d Sp 6s 5d 4f 6p 7s 6d 5f ... (5-58) 


When we get to principal quantum numbers of 3 and higher, the energy differences 
between different / values for the same n become comparable to the differences between 
different n levels. Thus, in some atoms, the 4s level is almost the same as the 3d level, 
etc. 

In compiling data on ground states of atoms, Hund noticed that greatest stability 
results if the AOs in a degenerate set are half-filled with electrons before any of them 
are filled. This generalization, called Hund’s rule, is sometimes stated in an alternative 
form: Of the states associated with the ground state configuration of an atom or ion, 
those with greatest spin multiplicity lie deepest in energy. Chemists generally find the 
former version to be more convenient, spectroscopists the latter. The reason for the 
equivalence of these statements will emerge later in this chapter. 


EXAMPLE 5-4 An unexcited Fe atom has an electronic configuration of 


1522522 p°3s73 p645?3q9. What is its spin multiplicity? 


SOLUTION » All electrons below 3d are spin-paired in orbitals, hence contribute nothing to 
Ms. In 3d®, we have 4 electrons that can each occupy a 3d AO alone. If we follow Hund's 
rule and seek maximum spin multiplicity, we make all their spins the same (o). Then maximum, 
Mg-—4- 1 —2, so $ 22, and spin multiplicity is 25+ 1 = 5. The atom has a quintet ground “state” 
(really five states). 4 


The energy-ordering scheme (5-58) coupled with the Pauli or exclusion principle 
and Hund's rule leads us to a simple prescription for "building up" the electronic con- 
figurations of atoms. This aufbau principle is familiar to chemists and leads naturally 
to a correlation between electronic structure and the periodic table. The procedure is to 
place all the electrons of the atom into atomic orbitals, two to an orbital, starting at the 
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low-energy end of the list (5-58) and working up in energy. In addition, when filling a 
set of degenerate levels like the five 3d levels, one half-fills all the levels with electrons 
of parallel spin before filling any of them. This prescription enables one to guess the 
electronic configuration of any atom, once its atomic number is known, unless it hap- 
pens to put us into a region of ambiguity, where different levels have almost the same 
energy. (Electronic configurations for such atoms are deduced from experimentally 
determined chemical, spectral, and physical properties.) The configuration for carbon 
(atomic number 6) would be 1s?2s?2p?, with the understanding that p electrons occupy 
different p orbitals and have parallel spins. (Recall that we expect the most stable of all 
the states arising from the configuration 1s?2s?2p? to be the one of highest multiplicity. 
The 2p electrons can produce either a singlet or a triplet state just as could the two 
electrons in the 1s2s configuration of helium. The triplet should be the ground state and 
this corresponds to parallel spins, which requires different p orbitals by the exclusion 
principle.) 

It is important to realize that the orbital ordering (5-58) used in the aufbau process 
is not fixed, but depends on the atomic number Z. The ordering in (5-58) cannot be 
blindly followed in all cases. For instance, the ordering shows that 5s fills before 4d. 
It is true that element 38, strontium, has a ---4p?5s?4d? configuration. But a later 
element, palladium, number 46, has ---4p°4d!°5s° as its ground state configuration. 
The effect of adding more protons and electrons has been to depress the 4d level more 
than the 5s level. 


E 5-7 Electron Angular Momentum in Atoms 


Most of our attention thus far has been with wavefunction symmetry and energy. 
However, understanding atomic spectroscopy or interatomic interactions (in reactions 
or scattering) requires close attention to angular momentum due to electronic orbital 
motion and “spin.” In this section we will see what possibilities exist for the total 
electronic angular momenta of atoms and how these various states are distinguished 
symbolically. 

We encountered earlier (Section 4-5) the notion that the total angular momentum for 
a classical system is the vector sum of the angular momenta of its parts. If the system 
interacts with a z-directed field, the total angular momentum vector precesses about 
the z axis, so the z component continues to be conserved and continues to be equal to 
the sum of z components of the system’s parts. Since quantum hydrogenlike systems 
obey angular momentum relations analogous to a precessing classical system, it is this 
z-axis behavior that we focus on as we seek to construct the nature of the total angular 
momentum from the orbital and spin parts we already understand. 

Because it is the total angular momentum that is conserved in a multicomponent 
classical system, it is the total angular momentum that obeys the quantum rules we have 
previously described for separate spin and orbital components. If we consider a one- 
electron system, the combined spin-orbital angular momentum can be associated with 
a quantum number symbolized by j (analogous to s and /). Then we can immediately 
say that the allowed z components of total angular momentum are, in a.u., m ; — c j, 
+(j—1),... and that the length of the vector is Vj (j + T) a.u. 

The implication of accepting total angular momentum as the fundamental quantized 
quantity is that the spin and orbital angular momenta do not individually obey the 


150 Chapter 5 Many-Electron Atoms 


quantum rules we have so far applied to them—-s and / are not “good” quantum numbers. 
However, for atoms of low atomic number they are in fact quite good, especially for 
low-energy states, and we can continue to refer to the s and / quantum numbers in such 
cases with some confidence. (Classically, this corresponds to cases where there is little 
transfer of angular momentum between modes.) 


5-7.A Combined Spin-Orbital Angular Momentum for 
One-Electron lons 


The key to understanding the following discussion is to remember that a quantum 
number /, s, or j really tells us three things: 


1. It equals the maximum value of m;, ms, or m j. If l — 2, the maximum allowed value 
of m; is 2, and the maximum z component of orbital angular momentum is 2 a.u. 


2. It allows us to know the length of the related angular momentum vector, I, s, or j, 
in a.u. For j, this is given by 4/ j(j 4- 1). If j —2, the length of the total angular 
momentum vector j is V6 a.u. 


3. It allows us to know the degeneracy, g, of the energy level due to states having this 
angular momentum. For s, this is 2s + 1. If s = 1/2, g —2. The corresponding / 
degeneracies produce the s, p, d, f degeneracies of 1,3,5, 7. 


Using the hydrogen atom as our example, let us consider what the total electronic 
angular momentum is in the ground (1s) state. For an s AO, / — 0, and so there is no 
orbital angular momentum. This means that the total angular momentum is the same 
as the spin angular momentum, so j = s = 1/2, m į; = +1/2. The diagram for the vector 
j. then, looks just like that for s (Fig. 5-3). 


NIO 


| m; (max) 


E 


m; (max) 


1 eje 3/2 
2 m, (max) 


(a) (b) 


Figure 5-6 » (a) Maximum z components of orbital and spin angular momenta for a p electron 
leading to a total z component of 3/2. (b) The four states corresponding to the j = 3/2 vector assuming 
its possible z intercepts (3/2, 1/2, —1/2, —3/2). 
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Figure 5-7 » (a) j= 1/2, resulting when lis oriented with its maximum z intercept and s is oriented 
in its other orientation (other than as in Fig. 5-6). (b) The two states corresponding to the J = 1/2 
vector assuming its possible z intercepts (1/2, —1/2). 


NIA 


More interesting is an excited state, say 2p. Now /=1 and s = 1/2. From / — 1 we 
can say that the maximum orbital z component of angular momentum is 1 a.u. s — 1/2 
tells us that there is an additional maximum spin z component of 1/2. The maximum 
sum, then, is 3/2 for the z component of j. But, if this is the maximum m ;, then j 
itself must be 3/2 and the length of j must be 4/(3/2) (5/2) = 1.94 a.u. There must be 
2j +1=4 allowed orientations of j, with z intercepts at 3/2, 1/2, —1/2, —3/2 in a.u. 
(Fig. 5-6). 

We are not yet finished with the 2p possibilities. The total angular momentum is 
the sum of its orbital and spin parts, and we have so far found the way they combine 
to give the maximum z component. But this is not the only way they can combine. It 
is possible to have m; = 1 and ms = —1/2. Then the maximum m ; = 1/2, so j= 1/2, 
giving us a vector j of length J/(1/2) (3/2) a.u. and two orientations (Fig. 5-7). 

So far we have identified six states, four with j = 3/2 and two with j= 1/2. This 
is all we should expect since we have three 2p AOs and two spins, giving a total 
of six combinations. It seems, though, that we could generate some more states by 
now letting m; — 0 or —1 and combining these with m, = +1/2. However, these 
possibilities are already implicitly accounted for in the multiplicity of states we rec- 
ognize to be associated with the j — 3/2, 1/2 cases already found. This illustrates 
the general approach to be taken when combining two vectors: Orient the larger 
vector to give maximum z projection, and combine this projection with each of the 
allowed z components of the smaller vector. This gives all of the possible m ;(max) 
values, hence all of the j values. In other words, it gives us all of the allowed vec- 
tors, j, each oriented with maximum z component, and it remains only to recognize 
that these can have certain other orientations corresponding to z intercepts of m ; — 1, 
m; —2,... —mj. 

States can be labeled to reflect all of the angular momentum parts they possess. The 
main symbol is simply s, p, d, f, g, etc. depending on the / value as usual. A superscript 
at left gives spin multiplicity (2s 4- 1) for the states. A subscript at right tells the 7 
quantum number for the states. If an individual member of the group of states having 
the same j value is to be cited, it is identified by placing its m ; value at upper right. 
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Thus, all six of the states discussed above can be referred to as ?p states. The two 
groups having different total angular momentum are distinguished as ?p3 /2 and ?pi /2: 


One of the four states in the former group is the ?ps. [7 ? state. 
The general form of the symbol is ?5*7 7J. Such symbols are normally called term 
symbols, and the collection of states they refer to is called a term (except when an 
individual state is denoted by inclusion of the m ; value). 

The reason for distinguishing between the 7p3 /2 and ?pi /2 terms is that they occur 
at slightly different energies. This results from the different energies of interaction 
between the magnetic moments due to spin and orbital motions. For instance, if / and 
s are coupled so as to give the maximum j, their associated magnetic moments are 
oriented like two bar magnets side by side with north poles adjacent. This is a higher- 
energy arrangement than the other extreme, where / and s couple to give minimum j, 
acting as a pair of parallel bar magnets with the north pole of each next to the south 
pole of the other. So 7p; /2 should be lower in energy than ?pa /2 for hydrogen. 


EXAMPLE 5-5 For a hydrogen atom having n — 3,/ — 2, what are the possible 


j values, and how many states are possible? Indicate the lengths of the j vectors 
in a.u. What term symbols apply? 


SOLUTION » 1f / —2 (d states), there are five AOs and two possible spins, so we expect a total 
of ten possible states. The maximum possible values of the z-component of angular moment for 
orbital and spin respectively are 2 and 1/2. So the maximum value is 5/2 giving a vector length 
of 4/35/4 a.u. and six possible z projections, hence six states. The term symbol is 2d5 /2. The 
remaining possible j value is 2 — 1/2 — 3/2, accounting for four more states and giving a vector 
length of ./15/4 a.u. and a term symbol of ?d3 p. 4 


5-7.B Spin-Orbital Angular Momentum for Many-Electron Atoms 


Much of what we have seen for one-electron ions continues to hold for many-electron 
atoms or ions. All the symbolism is the same, except that capital letters replace low- 
ercase: The quantum numbers are L, S, and J, and the main symbol becomes S, P, D, 
F, G, etc. There is a total orbital angular momentum vector L with quantum number 
L that equals the maximum value of Mz. The length of Lis / L(L +1) a.u., and it 
has 2L 4- 1 orientations. Vectors S and J behave analogously. When constructing the 
vectors J, we continue to place the larger of L and S to give maximum z intercept, 
and add to this the possible z intercepts of the smaller vector. The situation, then, is 
just as before except that we need to figure out the possible values for M; and Ms by 
combining the allowed values of m;(1), mj(2), ... and m,(1), ms 2), ... 8 


8This procedure of first combining individual orbital contributions to find L and spin contributions to find S 
and then combining these to get J is referred to as “L—S coupling,” or "Russell-Saunders coupling." The other 
extreme is to first combine l and s for the first electron to give j(1), l and s for the second electron to give j(2), ... 
and then combine these individual-electron j's to give J. This is more appropriate for atoms having high atomic 
number (in which electrons move at relativistic speeds in the vicinity of the nucleus), and is referred to as “j—/ 
coupling." We will not describe j—/ coupling in this text. The reader should consult Herzberg [6] for a fuller 
treatment. 
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For example, if we have found that M; (max) = 2 (which means L = 2) and 
M;(max) = 1 (which means S — 1), we have that M;(max) can be 2 + 1,2 +0, and 
2 4- (—1), or 3, 2, and 1. This means that the possible values of J are 3,2, 1, giving 
three different J vectors. Since L —2 and S= 1, the term symbols for these three J 
cases are ?D5, ?D», and ?D,. Notice that the multiplicities of these three terms—7, 
5, and 3, respectively, obtained from 2J + 1—total 15 states, which is just what we 
should expect for the ?D symbol (spin multiplicity of 3, orbital multiplicity of 5). The 
15 triplet-D states are found in three closely spaced levels, differing in energy because 
of different spin-orbital magnetic interactions. 

The problem remains, how do we find the M; and M; values that allow us to construct 
term symbols? There are two situations to distinguish in this context, and a different 
approach is taken for each. 

1. Nonequivalent Electrons. The first situation is exemplified by carbon in its 
1s?2s?2p3p configuration. It is not difficult to show that the electrons in the 1s and 
2s AOs contribute no net angular momentum and can be ignored: The spins of paired 
electrons are opposed, hence cancel, and the s-type AOs have no angular momentum, 
hence cannot contribute. However, even p, d, etc. sets of AOs cannot contribute if they 
are filled because then any orbital momentum having z intercept m is canceled by one 
with —m;. The important result is that filled subshells do not contribute to orbital or 
spin angular momentum. 'The remaining 2p and 3p electrons occupy different sets of 
AOs, hence are called nonequivalent electrons. 

Since these electrons are never in the same AO, they are not restricted to have opposite 
spins at any time—their AO and spin assignments are independent. There are three AO 
choices (pi, po, p_1) and two spin choices—six possibilities—for each electron, hence 
36 unique possibilities. We should expect, therefore, 36 states to be included in our 
final set of terms. 

We first find the possible L values. m; for each electron is 1, 0, or — 1. We orient the 
larger of the l vectors to give the maximum m;(1) = +1 and orient the second 1 in all 
possible ways, giving m;(2) = +1,0, —1. (Since the vectors have equal length in this 
case there is no “larger-smaller” choice to make.) The net Mz values are +2, 4-1, 0, 
and this tells us that the possible L values are 2, 1, 0. 

Treating ms values similarly gives M; = 1,0, so S= 1,0. 

Thus, we have three L vectors and two S vectors. We now combine every one of the 
L, S pairs. In each case, we again take the longer in its position of greatest z overlap 
and combine it with the shorter in all of its orientations. This gives the J values shown 
in Table 5-2. The appropriate term symbols follow from L, S, and J in each case. 
Thus, our term symbols are 3D3, ?D», ?D,, ! D5, ?P5, ?P,, ?Po,! P4, ?S, and ! So fora 
total of 7+5+3+5+5+4+3+414+3+43+1=36 states. 

In the absence of external fields, these 36 states occur in 10 energy levels, one for 
each term. These lie at different energies for several reasons. We have already seen, 
in our discussion of 1s2s helium states, that different spin multiplicities are associated 
with different symmetries of the spin wavefunction, meaning that the space part of 
the wavefunctions also differ in symmetry. This has a significant effect on energy, so 
IS and ?S, for example, have rather different energies. Different L values amount to 
different occupancies of AOs, which also has an effect on the spatial wavefunctions, so 
?P and ÎS have different energies. Finally, we have already seen that different J values 
correspond to different relative orientations of orbital and spin angular momentum 
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TABLE 5-2 » JL and S Values for Two 
Nonequivalent Electrons and Resulting J Values 
and Term Symbols 


L S J Term 
3 ?D4 
2 1 2 3D» 
1 3D, 
2 0 2 ID, 
2 3P, 
1 1 1 3p, 
0 3po 
1 0 1 Ip, 
0 1 1 3$, 
0 0 0 'So 


vectors, hence of magnetic moments. For light atoms, this is a relatively small effect, 
so ?P», ?P,, and 3Pp have only slightly different energies. The resulting energies for 
states of carbon in 1s?2s?2p?, 1s?2s?2p3p, and 1s?2s?2p4p configurations are shown 
in Fig. 5-8. Only the major term-energy differences are distinguishable on the scale of 
the figure. The line for ?D is really three very closely spaced lines corresponding to 
3D3, ?D», and 3D, terms. 

2. Zeeman Effect. It was pointed out in Section 4-6 that the orbital energies of 
a hydrogen atom corresponding to the same n but different m; undergo splitting when 
a magnetic field is imposed. Now we have seen that spin angular momentum is also 
present. Therefore, a proper treatment of the Zeeman effect requires that we focus on 
total angular momentum, not just the orbital component. Since there are 2J + 1 states 
with different M; values in a given term, we expect each term to split into 2J + 1 
evenly separated energies in the presence of a magnetic field, and this is indeed what 
is seen to happen (through its effects on lines in the spectrum). For example, a *P2 
term splits into five closely spaced energies, corresponding to M; = 2, 1,0, —1, —2, 
and a!P term splits into three energies. 

A surprising feature of this phenomenon is that the amount of splitting is not the 
same for all terms, despite the fact that adjacent members of any term always differ by 
+1 unit of angular momentum on the z axis. For instance, the spacing between adjacent 
members of the ?P» term mentioned above is 1.50 times greater than that in the Ip, 
term. It was recognized that terms wherein S =0, so that J is entirely due to orbital 
angular momentum (J = L), undergo “normal splitting”—1.e., equal to what classical 
physics would predict for the amount of angular momentum and charge involved. On 
the other hand, terms wherein J is entirely due to spin (Z =0, so J = S) undergo 
twice the splitting predicted from classical considerations. [This extra factor of two 
(actually 2.0023) was without theoretical explanation until Dirac’s relativistic treatment 
of quantum mechanics. ] 

Terms wherein J contains contributions from both L and S have Zeeman splittings 
other than one or two times the normal value, depending on the details of the way Z and 
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Figure 5-8 » Energy levels for carbon atom terms resulting from configurations 1s?2s?2p?, 
1s?2s?2p3p, and 1s?2s?2p4p. 


S are combined. The extent to which a term member's energy is shifted by a magnetic 
field of strength B is 


AE = gf, M;B (5-59) 


where f, is the Bohr magneton (Appendix 10) and g is the Landé g factor, which 
accounts for the different effects of L and S on magnetic moment that we have been 
discussing: 


ss J(J+1)4+S(S+1)-L(L+1) (5-60) 
2J(J +1) 


It is not difficult to see that this formula equals one when S —0, J = L, and equals two 
when L —0, J= S. For the 3P) term, S=1, L=1, J=2, and g equals 1.5, indicating 
that, in this state, half of the z-component of angular momentum is due to orbital 
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motion, and half is due to spin (which is double-weighted in its effect on magnetic 
moment). 

3. Equivalent Electrons. Observe that the energy-level diagram for carbon 
(Fig. 5-8) shows the 10 expected terms for the excited 2p3p and 2p4p configurations, 
but not for the ground 2p? configuration. There are no new terms for the latter case, 
but some of the terms present for 2p3p or 2p4p are gone, namely °D, 3S, and !P. The 
remaining terms account for 15 states. Evidently 21 states that are possible for a pair 
of nonequivalent p electrons are not allowed for a pair of equivalent electrons in a p? 
configuration. We will see that some of the states that are different for nonequivalent 
electrons become one and the same for equivalent electrons and must be excluded. 
Others are excluded by the Pauli exclusion principle because they would require two 
electrons to be in the same AO with the same spin. 

We now demonstrate the method for discovering the terms that exist for equivalent 
electrons. This is more difficult than for nonequivalent electrons, even though there are 
fewer terms. We first list all the orbital-spin combinations (called microstates), strike 
out those that are redundant or that violate the Pauli exclusion principle, and then infer 
from the remaining microstates what terms exist. 

Taking the p? case for illustration, we begin with the 36 microstates listed in 
Table 5-3. Some of these microstates are equivalent to others. For instance, 
2p1(1)2pi (2)a@(1) 8 (2) is not a different state from 2p; (1)2p1 2) 8 (1)a (2). These both 
correspond to a pair of electrons in the same pair of spin orbitals, 2p; and 2p,. Since 
electrons are indistinguishable, we cannot expect wavefunctions differing only in the 
order of electron labels to correspond to different physical states. [The single state that 
does exist would be accurately represented by 2p4(1)2p1(2)(a(1)8(2) — 8(1)a (2)), 
which is a linear combination of the microstates. But we do not need to go to this level 
of detail when finding terms. We only need to recognize that there is but one state here 
and omit one of the microstates as superfluous.] Accordingly, we strike out rows 3, 19, 
and 35 from Table 5-3, labeling them “R” for redundant. 

Another way to recognize this equivalence is to observe that the microstates deemed 
redundant differ only by an interchange of a pair of electrons. This reveals that the 
set of four microstates with 2p1(1)2po(2) is equivalent to the set with 2po(1)2pi (2). 
Therefore, we can strike out rows 13-16. A similar argument removes rows 25-32. 
Already we have removed 15 microstates. 

Next we look for violations of the Pauli exclusion principle. This leads us to strike 
out rows 1, 4, 17, 20, 33, and 36, labeling them “P” for Pauli. Our remaining microstates 
number 15 and are reassembled in Table 5-4, along with values of the quantum numbers 
for z components of the relevant angular momentum vectors for individual electrons as 
well as for their sum. 

At this stage of the argument, the final column of Table 5-4 (the term symbols) is not 
yet known. We are about to fill out this column by making use of a simple rule that is 
based on the diagrammatic device described earlier—placing the larger vector so that 
it has the maximum z extension and then placing the shorter vector in all its allowed 
orientations. It is not difficult to see that the maximum resultant z component (M 7) can 
be achieved in one and only one way, namely when both vectors give their maximum 
z projection. This means that the maximum-M ;-member of a given set of states in 
the same term should be recognized as corresponding to one and only one microstate, 
because there is only one way to achieve this orientation. So we look for this maximum 
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TABLE 5-3 » Unrestricted List of Space-Spin Combinations for a 
Pair of Electrons (Same Subshell). R = “Redundant,” P = “Pauli” 


Electron number 


Row number 1 2 1 2 Comment 
1 pı pı a a P 
2 pı pı a p 
3 Pi Pi B a R 
4 Pi pi p B P 
5 Pi po a a 
6 pi po o B 
7 pi po B o 
8 pi po p p 
9 Pl p-1 a a 

10 pı P-1 o B 

11 pı P-1 B a 

12 pı P-1 p p 

13 po Pl a a R 

14 Po Pl a B R 

15 po pi Boa R 

16 po pi B p R 

17 po Po a a P 

18 Po Po a B 

19 Po Po B a R 

20 Po Po B B P 

21 po p-1 a a 

22 Po p-1 a B 

23 Po p-i p o 

24 po p-i B B 

25 P-1 Pl a a R 

26 p-i pi a B R 

27 p-i pı B a R 

28 p-3 pı p B R 

29 p-1 po [ a R 

30 P-1 Po a B R 

3l p-i po p a R 

32 p-1 Po B B R 

33 p-1 p-1 a a P 

34 p-i p-i o p 

35 p-i pei B o R 

36 p-i PA B B P 
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TABLE 5-4 » Allowed Space-Spin Combinations and M Quantum Numbers for a Pair of 
p Electrons (Same Subshell) 


Microstate — mj(1) m2) m,(1  m,2) Mr: Mg M; State term 
pipi 1 1 1/2 —1/2 2 0 2 1pj2 
Pipowa I 0 1/2 1/2 1 1 2 8p 
Pipowp 1 0 12  -1A2 1 0 1 (Ds!) 
pipopo 1 0 -12 1/2 1 0 1 Py!) 
p1poBp 1 0 -12 -1⁄2 1 EI 0 ÊP®) 
pıp- 10 1 =1 1/2 1/2 0 1 1 QP,!) 
Pip-108 1 =l 1/2 —]/2 0 0 0 (1D59) 
pip-ipo Loeb -1⁄2 1/2 0 0 0 PID 
pip-ipp 1 = 12 “12 0 -1 -1 OP;5 
PopoaB 0 0 12 —-1/2 0 0 0  !gg 
Pop-1a%a 0 -1 1/2 1/2 -1 1 0 Qn) 
Pop-108 0 ga 12 -12 -l 0 -1 (D7) 
pop-iBo 0. —L ‘12 H2 . —1 0 —1 Opi) 
pop-ipPp 0 -1 -12 -1⁄2 1 1 2. QR) 
p-ip-1eB =l -l 1/2 —1⁄2 -2 0 -2 (ip 


My and, from its microstate, get the L and S values that go with it. That gives us the 
information we need to establish the term symbol. 

We start, then, by seeking the maximum M, value in Table 5-4. This is My =2, and 
it occurs twice (in the first two rows). The first of these goes with Mz = 2, Ms — 0. 
Since these result when L and S are giving their maximum z component, we conclude 
that L —2, S —0. This, then, is a member of the ! D» term. (It is the ID? member of 
that term, since M; = 2.) We label this row Ds and proceed to select microstates that 
can account for the other four members of this term. Our choice is controlled by the 
requirements that (1) the M; values for the other members must be 1, 0, —1, —2, and 
(2) we cannot have an | M;| value larger than zero or an | M; | value larger than 2. (That 
would be impossible for states resulting from vectors having L — 2 and S —0.) Our 
selections are indicated in Table 5-4, with parentheses to indicate that these assignments 
follow from recognition of the leading member 1D}. (All are symbolized as !D».) 

Thereis some arbitrariness in selecting the “inner” members, for which |M j| < J: The 
parenthetical term ID? could just as easily be assigned to pıpof« as pipoof. (Actually, 
neither of these microstates is a correct wavefunction for 'D3. A linear combination of 
them is. But, if we only wish to designate term symbols, we need not worry about this.) 

We have accomplished already the identification of a term ! D» and the elimination of 
five microstates from our list. The other microstate having M; —2 has M; = M,;=1, 
so we know this goes with L = 1, S — 1 and has the symbol aps. Again, four other 
members exist down the table, and we select them, being careful that | M; | and | M;| 
do not exceed 1, while M; — 1,0, —1, —2. 

At this point, we must recognize that we are not through with the ?P family. The 
existence of the ?P part of the symbol implies the existence of nine states, but ?P» 
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accounts for only five of them. The others come from 3P; and ?Po, resulting from 
My =1 with M; =0, —1. (We did not worry about this for the !D» term because only 
five states are implied by !D.) So we seek the microstates associated with these terms 
and label them as shown in the table. 

Only one microstate remains. For this Mr = M; = 0, so this is a state labeled SU 

The term symbols for the p? configuration, then, are 1D}, ?P5,?P,, 3po, 1So, for a 
total of 15 states. The energies for these terms are shown in Fig. 5-8. The five terms 
fall into three energy groups, since the ?P terms are found to be very close in energy. 
High resolution spectra can be used to see the slight energy differences between terms 
that appear to be at the same level at the energy scale used in Fig. 5-8. Delving further, 
the degenerate energies of microstates in the same term can be made to separate by 
imposing a magnetic field (Zeeman effect). 

Based on spectroscopic assignments of energy levels for large numbers of atoms, 
Hund proposed a set of rules enabling one to predict the energy ordering for terms 
associated with equivalent electrons. These rules are, in order of decreasing influence: 


1. Terms having greater spin multiplicity lie lower in energy. 
2. Within each spin multiplicity, terms having greater L lie lower. 


3. Within the same L and S, levels of different J behave oppositely, according to 
whether the subshell is more or less than half-filled: If less than half-filled, terms 
with lower J lie lower. 


According to these rules, the five levels for carbon in its 1s*2s*2p? configuration 
should be, in order of increasing energy, 3Po, ?P4,? P, (closely spaced) followed by IDo, 
followed by 1Sọ. The actual energies (in cm7!) are, respectively, 0, 16.4, 43.5, 10194.8, 
21647.7 (Fig. 5-8). The order of states for the excited 2p3p and 2p4p configurations 
is different. This is not a breakdown of Hund's rules because these are not equivalent- 
electron cases. 

Hund's first rule is the source of the aufbau rule, cited earlier, that each AO of a 
subshell becomes half-filled before any of them become filled with electrons. The 
equivalence of these statements is easily demonstrated (Problem 5-30). 

One can use Hund's rules to find the lowest-energy state term symbol without going 
through the tedious microstate process just described. For an atom having an outer 
subshell configuration of p? we would first recognize that we seek maximum $, so the 
electrons must have parallel spin, giving S= 1. (We use Hund’s most influential rule 
first.) Subject to this constraint, we seek maximum L. Since the electrons cannot both 
be in pı with the same spin, pipo is next best, giving maximum Mz = 1, so L=1. 
S=1,L=1 gives J —2,1,0, so we know the corresponding terms are 3p», 3P}, 3Po. 
Since the 2p subshell is less than half- filled, ?Po is the ground term. 


N 5-8 Overview 


This chapter describes the new features that appear when we deal with systems having 
more than one electron. One of these features, interelectron repulsion, is easy to 
understand in its manifestation as operators in the hamiltonian and as coulomb repulsion 
integrals, J, in the average energy expression. Another feature, antisymmetry for 
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electron exchange and the resulting existence of exchange integrals, K, is unfamiliar 
and unintuitive, without a classical counterpart, yet is enormously important in its effect 
on electronic structure. 

In addition to these features, we have noted the importance of recognizing that atomic 
states conserve magnitude and z component of total angular momentum. Using this 
permits us to characterize states in terms of J, L, and S (even though the latter two are 
not “good” quantum numbers) for ground and excited configurations. This is essential 
in atomic spectroscopy (a topic we do not pursue in this book) and also allows one, 
with the assistance of Hund's rules, to predict the energy order for states associated 
with the ground configuration of any atom. 

In closing this chapter, we should emphasize again a point frequently forgotten by 
chemist. In the orbital approach to many-electron systems we have a convenient approx- 
imation. 'This is an imperfect but useful way to describe atomic structure. There are 
more accurate ways to approximate eigenfunctions of many-electron hamiltonians, but 
this usually involves more difficulty in interpretation. The orbital representation of y 
appears to be the best compromise between accuracy and convenience for most chemical 
purposes. 


5-8.A Problems 
5-1. Write down the hamiltonian operator for the lithium atom, in a.u. 


5-2. Calculate the values of rj and 72 consistent with the He wavefunction y (1, 2) = 
1s(1)2s(2)... (Eq. 5-11). 


5-3. Calculate the energy in electron volts of a photon with associated wavelength 
0.l a.u. Compare this result with the ionization energy in electron volts of the 
hydrogen atom in its ground state. Why is this comparison relevant? 


5-4. Show that the wavefunction (5-15) is normalized if the 1s and 2s orbitals are 
orthonormal. 


5-5. Show that the wavefunction (5-16) is antisymmetric with respect to exchange of 
electron coordinates. 


5-6. Show that the wavefunction (5-37) would vanish if 2s were replaced throughout 
by 1s, giving a Is?configuration. 


5-7. Produce a totally antisymmetric wavefunction starting from the configuration 
1s(1)a(1)2p (2) 8 (2)1s(3) 8 (3). Use the method described for Eq. (5-37) and use 
a determinantal function as a check. 


5-8. Set up the integral of the product between (1sls2saBa)* and 2slslsaap. (Use 
symbols rather than explicit atomic orbital formulas.) Factor the integral into 
a product of integrals over one-electron space functions and one-electron spin 
functions. Indicate the value of each of the resulting six integrals and of their 
product. 


5-9. a) Write down the Slater determinantal wavefunction for the configuration 
1s1s2p;. 
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5-10. 


5-11. 


5-12. 


5-13. 


5-14. 


5-15. 


5-16. 


5-17. 


b) Expand this determinant into a linear combination of products. 

c) Write down the nonzero part of expansion (b) when r3 20, rı = 1 a.u., r2 =2 
a.u. [Do not evaluate the expression; just use symbols like 1s (r — 1).] Also 
write down the nonzero part of (b) when r2 = 0,7, = 1, r3 = 2, and when 
rı =0, r2 —2, r3 — 1. Is there any physical difference between saying "electron 
3 is at the nucleus" and saying "an electron is at the nucleus?" Explain. 


Wavefunction (5-38) describes a member of a doublet. Write the wavefunction 
for the other member. 


A particle is capable of being in any one of three spin states. Call them o, f, 
and y. Suppose you have two such particles in a molecule. 


a) Write down all the spin functions you can that are symmetric for exchange 
of these two particles. (Do not worry about normalization.) 
b) Write down all the antisymmetric cases. 


The following wavefunction is proposed for an excited state of the lithium atom 


Is() 1s) 1s(3) 
y-——-|2s(1) 2sQ) 2s(3) 
3s(1) 3sQ) 3s) 


Here 1s, 2s, and 3s are eigenfunctions for the Li?* hamiltonian. 


a) Does this wavefunction satisfy the Pauli exclusion principle? Explain. 

b) Write the exact H for the lithium atom in atomic units. 

c) Is y an eigenfunction for the exact hamiltonian? 

d) If interelectronic repulsion terms are neglected in H, what energy, in a.u., is 
associated with y? 

e) What z component of spin and orbital angular momentum (in atomic units) 
would you expect for the atom in this state, ignoring any nuclear contribution? 


Write the normalized Slater determinantal wavefunction for beryllium in the 
1s?2s? configuration. Do not expand the determinant. 


Write down the ground state configuration for the fluorine atom. Use Slater's 
rules to find the orbital exponents ¢ = (Z — s)/n for 1s and 2s, 2p orbitals. 


Show that the average value of an operator for a state described by an eigenfunc- 
tion for that operator is identical to the eigenvalue associated with that eigen- 
function. 


Explain briefly the observation that the energy difference between the 
1s?2s! 8/7) state and the 1s?2p! QP1,2) state for Li is 14,904 cm~!, whereas 
for Li?* the 2s! S 1/2) and 2p! Pi /2) states are essentially degenerate. (They 
differ by only 2.4 cm!) 


In Chapter 4 it was stated that the magnitude of the square of the angular momen- 
tum is given by /(/ + 1) a.u., and that z components can be any of the values 
/,-/+1,.../—1, L a.u. Similar relations hold for spin. From this fact plus 
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5-18. 


5-19. 


5-20. 


5-21. 


5-22. 
5-23. 


5-24. 
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the knowledge that the possible z components of spin angular momentum are 
+5 a.u., calculate the length of the spin angular momentum vector. 


It has been shown that, for a single spinning electron, two spin states are possible 
having z components of spin angular momentum of +1/2 and —1/2 a.u. For two 
unpaired electrons, the state of greatest multiplicity is a triplet (M; = +1, 0, —1). 
Show that, in general, the maximum spin multiplicity resulting from n unpaired 
electrons equals n + 1. 


You have been shown symbolically that 1s(1)2s(2) + 2s(1)1s(2) and a (1)8 (2) + 
P (1)a. (2) are symmetric or antisymmetric for exchange of electron labels (elec- 
tron coordinates). For a more concrete and familiar example, take two functions: 
f(x) =exp(x) and g(y) = y?. Construct combinations of these functions that 
are symmetric and antisymmetric for exchange of x and y coordinates. Set x — 1 
and y — 2 and evaluate each function. Now set x — 2 and y — 1 and evaluate 
again. Compare results. 


Give all the allowed term symbols for a hydrogen atom (a) in the n = 1 level, 
(b) in the n — 2 level. In each case, total up the states to see whether you have 
the expected number. 


Consider the following helium atom wavefunction: 
y = 1s(1)3d,2 2)a (1)a (2) 


a) Is this a satisfactory wavefunction in the sense of meeting general symme- 
try conditions resulting from particle indistinguishability and the exclusion 
principle? If not, how would you modify it to make it satisfactory? 

b) Identify the term to which this state (modified if necessary) belongs. 


How many states exist for the configuration spd? 


A group of related terms has the common symbol ?P. (This is called a term 
multiplet.) 


a) What are the full term symbols for this multiplet? 

b) How many energy levels exist (in the absence of a magnetic field) for this 
multiplet? 

c) Indicate into how many levels each member of the multiplet splits in the 
presence of an external magnetic field. 


Given the following space part of an approximate wavefunction for a Lit ion: 
(1/./2)[1s(1)2p1 (2) + 2pi(1)1sQ)], 


a) Write a physically possible spin part for this wavefunction. 

b) What energy would this state have (in a.u.) if the 1/712 term in H did not 
exist? 

c) What average energy (expressed in terms of symbols like J) would this state 
have using the correct H (including 1/712)? 

d) You have not been shown the rules for operating with S*, the operator for the 
square of total spin angular momentum, but you can nevertheless guess what 
the result would be if S? operates on this state function. What is your guess? 
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5-25. 


5-26. 


5-27. 


5-28. 


5-29. 


5-30. 


5-31. 


5-32. 


5-33. 


5-34. 


5-35. 


A state in the term ?D3 is described by the wavefunction y. What is the value of 
x in each of the expressions Op y = x, where Op is as given below? (Assume 
L—S coupling to be valid. If more than one x is possible, list them all.) (a) Z? 
(b) S? (c) J? (d) Lz (e) S: (f Je. 


Carbon (1s?2s?2p?) and oxygen (1s?2s?2p^) have a “symmetrical” relation in 
their 2p occupancy: C has one electron /ess than a half-filled subshell, O has one 
electron more. Another way of stating this is to note that C has 2 electrons and 
4 holes in its 2p shell, while O has 2 holes and 4 electrons. 


a) Show that this leads to the same lowest-energy family (or *multiplet") of term 
symbols, P5 91. 

b) How do these atoms differ in the energy-ordering of these three terms? 

c) Show that this agreement in lowest-energy multiplet terms holds in general 
for atoms having this symmetrical occupation relation. 


Predict the ground state term symbol for each of the following atoms. 


a) Na (1s?2s?2pÓ3s) 

b) P (1s22s22p63s?3p?) 

c) Ne (1s?2s?2p6) 

d) Ti (1s22s?2p63s?3p64s?3d?) 


Calcium atoms are excited to the [Ar]4s4p configuration. 


a) How many states are there? 
b) What are the term symbols? 


a) Find all the terms for boron in its ground configuration, 1s?2s?2p, and order 
these terms according to energy. 
b) Repeat for phosphorus, [Ne]3s?3p?. 


Explain how Hund's first rule is equivalent to the aufbau rule that degenerate AOs 
half-fill with electrons before any are filled, when forming the lowest-energy 
state(s). 


How many states exist for each of the following term multiplets? 


a) ?D 
b) ?F 


For a given electron configuration, are all of the following terms possible? 
Explain your reasoning. ?p4 /2» 2P, I2 1So. 


How many states exist for each of the following configurations? (a) sd (b) sp 
(c) s?p (d) pd (e) dd (nonequivalent) 


a) How many states are associated with the ^F term multiplet? 
b) Write down the term symbols included in this multiplet. 


By inspection, what is the term symbol with the maximum J value we can 
have for the configuration sd? What other terms would be included in the same 
multiplet? 
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5-36. Derive a formula for the number of states that exist for two equivalent electrons 


in a subshell having degeneracy g. How many states does this predict for p?? 
for d?? 


5-37. Evaluate the splitting between adjacent lines in Zeeman-split terms *D3, ?D», 


3D, ! D5, when B equals 1 tesla. 


Multiple Choice Questions 


(Try to answer these without referring to the text.) 


1. 


Which one of the following is an acceptable (unnormalized) approximate wavefunc- 
tion for a state of the helium atom? 


a) [1s(D15Q2) — 15(D1s(2)]o (1) 2) 

b) I1s(D)1sQ)[a(1)8Q) + BO)o 2)] 

c) [15(1)252) + 2s(1)15(2)]o (1)a (2) 

d) [15(1)25(2) + 2s(1)1s(2)]Ia (D 8Q) — BU )e(2)] 


e) None of the above is acceptable. 


. The spin multiplicity of an atom in its ground state and having the outer-shell con- 


figuration 4s73d’ is 


a) 19 
b) 15 
c) 7 
d) 5 
e) None of the above. 


. Which one of the following statements is NOT true for the ground state of the helium 


atom? 


a) The atom’s size (measured by rg) is larger than the size of He* in its 1s state. 
b) The ground state is a singlet. 

c) The energy of the 2po orbital is above that of the 2s orbital. 

d) The effective nuclear charge seen by both electrons is less than 2. 

e) The atom’s electronic energy is equal to —108.8 eV. 


. The crudest orbital model for the ground state of He uses the 1s atomic orbitals for 


Het, for which Z —2. Which of the following statements describes correctly the 
situation that pertains to a change to a more appropriate value? 


a) The improved Z value is larger than 2, and the orbitals become more contracted. 

b) The improved Z value is larger than 2, and the orbitals become more expanded. 

c) The improved Z value is smaller than 2, and the orbitals become more contracted. 

d) The improved Z value is smaller than 2, and the orbitals become more expanded. 

e) The improved Z value is smaller than 2, but this only affects the computed energy, 
and not orbital size. 
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Chapter 6 


Postulates and Theorems 
of Quantum Mechanics 


E 6-1 Introduction 


The first part of this book has treated a number of systems from a fairly physical view- 
point, using intuition as much as possible. Now, armed with the concepts already 
developed, the reader should be in a better position to understand the more formal foun- 
dation to be described in this chapter. This foundation is presented as a set of postulates. 
From these follow proofs of various theorems. The ultimate test of the validity of the 
postulates comes in comparing the theoretical predictions with experimental data. The 
extra effort required to master the postulates and theorems is repaid many times over 
when we seek to solve problems of chemical interest. 


[| 6-2 The Wavefunction Postulate 


We have already described most of the requirements that a wavefunction must satisfy: y 
must be acceptable (1.e., single-valued, nowhere infinite, continuous, with a piecewise 
continuous first derivative). For bound states (i.e., states in which the particles lack the 
energy to achieve infinite separation classically) we require that y be square integrable. 
So far we have considered only cases where the state of the system does not vary with 
time. For much of quantum chemistry, these are the cases of interest, but, in general, a 
state may change with time, and y will be a function of t in order to follow the evolution 
of the system. 
Gathering all this together, we arrive at 


Postulate I Any bound state of a dynamical system of n particles is described as 
completely as possible by an acceptable, square-integrable function 
V (qi. 92) ...Q32, 01, €2, -.. , On, f), where the q's are spatial coordi- 
nates, w’s are spin coordinates, and t is the time coordinate. V*W dt is 
the probability that the space-spin coordinates lie in the volume element 
dt(=dt1d12---dt,) at time t, if V is normalized. 


For example, suppose we have a two-electron system in a time-dependent state 
described by the wavefunction V (x1, y1, Z1, €1, X2, 2, Z2, @2, t). The spin coordinates 
œw would each be some combination of spin functions a and f. If we integrate V*W 
over the spin coordinates of both electrons, we are left with a spin-free density function. 
Call it o(x1, yi, 21, X2, 2, 2, f) = o(v1, v2,t). We interpret p(v1, v2, f) dvi dv? as 
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the probability that electron 1 is in dv, (i.e., between x; and x1 + dx, yı and yı + 
d y, and zı and zı + dz) and electron two is in dv? at time t. If we now integrate 
over the coordinates of electron 2, we obtain a new density function, o’(v1, t), which 
describes the probability of finding electron 1 in various volume elements at various 
times regardless of the position of electron 2. 


E 6-3 The Postulate for Constructing Operators 


Much of the substance of the second postulate is already familiar. We earlier used 
arguments based on de Broglie waves to construct hamiltonian operators. We then 
noted that the kinetic energy part of the operators can be identified with a classical 
term like p /2m through the relation px < (/i)ð/ðx. The potential energy terms 
in the hamiltonian operators are completely classical, however. Thus, we could have 
constructed the quantum mechanical hamiltonians by writing down the classical energy 
expressions in terms of momenta and position, and thenreplacing every momentum term 
by the appropriate partial differential operator. This is an example of the use of part c of: 


Postulate II 7o every observable dynamical variable M there can be assigned a linear 
hermitian operator M. One begins by writing the classical expression, 
as fully as possible in terms of momenta and positions. Then: 


a) If M isq ort, M is q or t. (q and t are space and time coordinates.) 


b) If M is a momentum, pj, for the jth particle, the operator is 
(A/ 1)8/8qj, where q j is conjugate to pj (e.g., xj is conjugate to py ;). 


c) If M is expressible in terms of the q's, p's and t, M is found by 
substituting the above operators in the expression for M in such a way 
that M is hermitian. 


The reason for specifying that M. must be hermitian is that the eigenvalues of a 
hermitian operator must be real numbers.! We shall discuss this and other aspects of 
hermiticity (including its definition) later in this chapter. 

As an explicit example of this procedure, we reconsider the hydrogen atom. Assum- 
ing a fixed nucleus (infinite inertia), the classical expression for the total energy of the 
system is 


Eclassical = (1/2me) Cp? H p, + p2) x e [4zeo G2 F y JE Ze?) 


where the first term is just the kinetic energy of the electron and the second term is the 
electrostatic potential energy. The coordinate origin is on the nucleus. Application of 
postulate II retains the position variables x, y, and z of the potential term unchanged, 
but replaces p. by (A/i) 0/0x, etc.: 


Lied) a2 h WAT. (hay (hay 
2m. T Py Pz} Om. lA2xióx) | A2xidy)  \2riðz 


—h2 y 


822me 


lIn this text (and in quantum chemistry in general) a caret indicates an operator and not a unit vector quantity 
as in classical physics. 
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Thus, we arrive at 


—h2 5 e? 
8z?m, 4r eo (x? + y? 4- z2)1/? 


and we are now free to transform A to other coordinate systems (such as r, 0, $) if 
we wish. 


E 6-4 The Time-Dependent Schrödinger Equation Postulate 


We have discussed only cases where neither the hamiltonian A nor V is time dependent. 
In those cases we required that y be an eigenfunction of H. In the more general case 
in which V and Z are time dependent,” a different requirement is imposed by 


Postulate III The state functions (or wavefunctions) satisfy the equation 


3 -h à 
id DUE LA (6-1) 


where # is the hamiltonian operator for the system. 


We should check to see if this is consistent with the time-independent Schrédinger 
equation we have been using. Suppose that the hamiltonian is time independent. Let 
us see if a solution to Eq. (6-1) exists when WV (q, t) is separated into a product of space- 
and time-dependent functions: Y (q, t) — v (q) f (t). Inserting this into Eq. (6-1) gives 


A —h à 
Hy no) S ware) (6-2) 
Dividing by Y (q) f (t) gives 


AW) (-h/i)(9/90 f() as 
V (q) LO 
Since each side of Eq. (6-3) depends on a different variable, the two sides must equal 
the same constant, which we call E. This gives 


Aw(q)=Ev@) (6-4) 


and 


—h d 
= dp CET (6-5) 


The first of these equations is just the time-independent Schródinger equation we have 
been using. The second equation has the solution f(t) = A exp(—iEt/h). Hence, f* f 
equals a constant, and so Y* Y = y* y f* faw*w. Since f has no effect on energy or 
particle distribution, we can ignore it in dealing with stationary states. The situation is 
analogous to the case of standing waves discussed in Chapter 1. 


? fand Y symbolize time dependence; É and V symbolize time independence. 
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Note that while we have shown that solutions may exist in which Y is separable, this 
does not mean that every solution of Eq. (6-1) with H= His separable (i.e., stationary). 
We can imagine a situation where a system in a stationary state is suddenly perturbed 
to produce a new time-independent hamiltonian. V will change as the system adjusts 
to this new situation, giving us a case where the hamiltonian is time-independent (after 
the perturbation, at least) but V is not a stationary state function. The way in which V 
evolves in time is governed by Eq. (6-1). 


EXAMPLE 6-1 Show that the average energy for a nonstationary state of the hydro- 


gen atom is conserved as the system evolves, if H is not time-dependent. 


SOLUTION » We choose a simple example: 
1 
J2 


where each exponential equals exp(—i Et/f). Then 


y = —nis expGt/2h) + V»; expGt/8h)] 


(e | wife dvdr => f icexp(—it/2m yas enit [20 vd 


+ the analogous 252s term +1s2s and 2s1s cross terms. Since Ê does not operate on functions 
of t, the exponentials in each of the first two integrals can join together, giving exp(0) 2 1. So time- 
dependence disappears from the first two integrals, and they become respectively equal to — l a.u., 
and — H a.u. Dependence on time does not disappear from the cross-term integrals, but that doesn't 
matter because the integration over space gives zero in each case, due to orbital orthogonality. 
Thus, < E >= 1-1 a.u.) + i(t a.u.) = — fs a.u., which has no time-dependence. < 


E 6-5 The Postulate Relating Measured Values 


to Eigenvalues 


The second postulate indicated that every observable variable of a system (such as 
position, momentum, velocity, energy, dipole moment) was associated with a hermitian 
operator. The connection between the observed value of a variable and the operator is 
given by 


Postulate IV Any result of a measurement of a dynamical variable is one of the eigen- 
values of the corresponding operator. 


Any measurement always gives a real number, and so this postulate requires that 
eigenvalues of the appropriate operators be real. We will prove later that hermitian 
operators satisfy this requirement. 

If we measured the electronic energy of a hydrogen atom (the negative of its ion- 
ization energy), we could get any of the allowed eigenvalues (—1/2n? a.u.) but no 
intermediate value. What if, instead, we measured the distance of the electron from the 
nucleus. By postulate II, the operator for this property is just the variable r itself. That 
is, ? =r. Hence, we need to consider the eigenvalues of r in the equation 


ró(r,0, 0) —A6(r,0,Q) (6-6) 
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where ô is an eigenfunction and À is a real number (corresponding to the distance of 
the electron from the nucleus). We can rewrite this equation as 


(r—A)d(r, 0,6) Z0 (6-7) 


This form makes it more apparent that the function ô must vanish at all points in space 
except those where r = A. But A is an eigenvalue of r and hence is a possible result 
of a measurement. Thus, we see that postulate IV implies some connection between 
a measurement of, say, r — 2 a.u. and an eigenfunction of r that is finite only at 
r —2 a.u. We symbolize this eigenfunction ó(r — 2 a.u.), this “delta function" being 
zero whenever the argument is not zero. If we measured the electron's position to be 
at r = 5.3 a.u., the corresponding eigenfunction would be ó(r — 5.3 a.u.)—a function 
that is zero everywhere except in a shell of infinitesimal thickness at r — 5.3 a.u. If 
instead we measured the point in space of the electron, rather than just the distance 
from the nucleus, and found it to be ro, 00, $o, then the corresponding eigenfunction 
of the position operator would be ó(r — ro)ô (0 — 09)6($ — po). This function vanishes 
everywhere except at ro, 09, Go. 

It is evident that any value of A from zero to infinity in Eq. (6-7) may be chosen 
without spoiling the ability of ô to serve as an eigenfunction of r. This means that, 
unlike the energy measurement, the measurement of the distance of the electron from 
the nucleus can have any value. 

The eigenfunctions of the position operator are called Dirac delta functions. They 
are “spike” functions having infinitesimal width. They are normalized through the 
equation 


E (x —xo) dx =1 (6-8) 


where the integration range includes xo.? On first acquaintance, these functions seem 
mathematically peculiar, but they make physical sense in the following way. One can 
interpret the actual measurement of position as a process that forces the particle to 
acquire a certain position at some instant. At that instant, Y? for the system (now 
perturbed by the measuring process) ought to give unit probability for finding the 
particle at that point (where it definitely is) and zero probability elsewhere, and this is 
just what the Dirac delta function does.* 

Postulate IV, then, is in accord with a picture wherein the process of measurement 
forces the measured system into an eigenstate for the appropriate operator, giving the 
corresponding eigenvalue as the measurement. This definition of “measurement” is 
somewhat restrictive and can be deceptive. Often scientists refer to measurements 
that are really measurements of average values rather than eigenvalues. This point is 
discussed further below. 


3 The reader should avoid confusing the Dirac delta function 6(x — xo) with the Kronecker delta 4;, ; encountered 
earlier. They are similar in that both vanish unless x = xg in the former and i = j in the latter. But they differ 
in that the value of 5;, j is definite (unity) while the value of 8(xo — xo) is not defined. The Dirac delta function 
has definite value only in integrated expressions like Eq. (6-8). The spin functions o and f may be thought of as 
Dirac delta functions in the spin “coordinate” w. The Dirac delta function is admittedly unusual, and one tends to 
be uneasy with it at first. This function is important and useful in quantum mechanics. However, since we will 
make almost no use of it in this text, we will not develop the topic further. 

^Notice that Eq. (6-8) does not involve 8*5, but merely 5. Because ô is nonzero only at one point, 8*0 is likewise 
nonzero only at the same point. à and 9*6 are therefore not independent functions. It is convenient to view the 
6 function as both the eigenfunction for the position operator and also as the probability distribution function for 
the particle. 
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N 6-6 The Postulate for Average Values 


Suppose that we had somehow prepared a large number of hydrogen atoms so that they 
were all in the same, known, stationary state. Then we could measure the distance of 
the electron from the nucleus once in each atom and average these measurements to 
obtain an average value. We have already indicated that this average would be given 
by the sum of all the r values, each multiplied by its frequency of occurrence, which is 
given by yy? dv if y is normalized. Since r is a continuous variable, the sum becomes 
an integral. This is the content of 


Postulate V When a large number of identical systems have the same state function 
V, the expected average of measurements on the variable M (one mea- 
surement per system) is given by 


Mee J y* Mydr / J Vy di (6-9) 


The denominator is unity if y is normalized. 

It is important to understand the distinction between average value and eigenvalue 
as they relate to measurements. A good example is the dipole moment. The dipole 
moment operator for a system of n charged particles is u = ) 7 ., zir; where z; is the 
charge on the ith particle and r; is its position vector with respect to an arbitrary origin. 
(We get this by writing the classical formula and observing that momentum terms 
do not appear. Hence, the quantum-mechanical operator is the same as the classical 
expression.) What will the eigenfunctions and eigenvalues of fi be like? 

The charge z; is only a number, while r; is a position operator, which has Dirac 
delta functions as eigenfunctions. For a hydrogen atom, one eigenfunction of r; would 
be a delta function at r = 1 a.u., 0 — 0, $ —0. The corresponding eigenvalue for jz 
would be the dipole moment obtained when a proton and an electron are separated by 
1 a.u., clearly a finite number. But “everybody knows" that an unperturbed atom in a 
stationary state has zero dipole moment. The difficulty is resolved when we recognize 
that measurement of a variable in postulates IV and V means measuring the value of 
a variable at a given instant. Hence, we must distinguish between the instantaneous 
dipole moment of an atom, which can have any value from among the eigenvalues of 
ji and the average dipole moment, which is zero for the atom. In everyday scientific 
discussion, the term “dipole moment” is usually understood to refer to the average 
dipole moment. Indeed, the usual measurements of dipole moment are measurements 
that effectively average over many molecules or long times (in atomic terms) or both. 


[p 6-7 Hermitian Operators 


Let ¢ and y be any square-integrable functions and A be an operator, all having the 
same domain. 4 is defined to be hermitian if 


| ions [ons (6-10) 


The integration is over the entire range of each spatial coordinate. Recall that the 
asterisk signifies reversal of the sign of i in a complex or imaginary term. The hermitian 
property has important consequences in quantum chemistry. 
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As an example of a test of an operator by Eq. (6-10), let us take the y and $ to 
be square-integrable functions of x and A to be i(d/dx). Then the left-hand side of 
Eq. (6-10) becomes, upon integration by parts, 


+o + o +o +a 
| w*(i dġ/dx) dx = wu —i | (dj*/dx) dx = -i Í d(di*/dx) dx 


(6-11) 


Since w and $ are square integrable, they (and their product) must vanish at infinity, 
giving the zero term in Eq. (6-11). We now write out the right-hand side of Eq. (6-10): 


+00 +00 
J oi djdx)*y'ds =—i Í $ (dy* /dx) dx (6-12) 
—OO —C 

where the minus sign comes from carrying out the operation indicated by the aster- 
isk. Equation (6-12) is equal to Eq. (6-11), and so the operator i(d/dx) is hermitian. 
Since the effect of i was to introduce a necessary sign reversal, it is apparent that the 
equality would not result for A-—d /dx. Clearly, any hermitian operator involving a 
first derivative in any Cartesian coordinate must contain the factor i. The operators for 
linear momenta (Chapter 2) are examples of this. 

It is important to realize that Eq. (6-10) does not imply that Ao = QÀ*y*. 
A simple example will make this clearer. Let A be the hydrogen atom hamiltonian, 
H= -iv? —1/r, and let $ be the 1s eigenfunction: $ = (1/./7) exp(—r). Also, let 
y = /8/z exp(—2r) which is not an eigenfunction of H. Then, since Å g=- 19, 


^ 1 
V'He-—-5V^$ (6-13) 


But 


oH*y* =¢ [— strum aan) = ur iin exp(—2r) (6-14) 
= $[(1/r) — 2] 8/z exp(—2r) 2 [0/r) — 2] v*9 (6-15) 


(Since y has no 0 or ¢@ dependence, the parts of É* that include à /00 and 0/d¢ have 
been omitted in Eq. (6-14).) Here we have two functions, — sur and [(1/r) —2]w*@. 
They are obviously different. However, by Eq. (6-10), their integrals are equal since 
Å is hermitian. 


NH 6-8 Proof That Eigenvalues of Hermitian Operators 
Are Real 


Let A be a hermitian operator with a square-integrable eigenfunction y. Then 
Ay —ay (6-16) 


Each side of Eq. (6-16) must be expressible as a real and an imaginary part. The real 
parts must be equal to each other and so must the imaginary parts. Taking the complex 
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conjugate of Eq. (6-16) causes the imaginary parts to reverse sign, but they remain 
equal. Therefore, we may write 


A*y* =a*y* (6-17) 
We multiply Eq. (6-16) from the left by y* and integrate over all spatial variables: 
[way dv=a [ wt dv (6-18) 
Similarly, we multiply Eq. (6-17) from the left by y and integrate: 


f eme f yya (6-19) 


Since A is hermitian, the left-hand sides of Eqs. (6-18) and (6-19) are equal by definition 
(Eq. 6-10). Therefore, the right-hand sides are equal, and their difference is zero: 


a-a | w*w dvo (6-20) 


Since w is square integrable the integral cannot be zero. Therefore, a — a* is zero, 
which requires that a be real. 


E 6-9 ProofThat Nondegenerate Eigenfunctions of a 
Hermitian Operator Form an Orthogonal Set 


Let w and ¢ be two square-integrable eigenfunctions of the hermitian operator A: 


Ay =a (6-21) 
A*$* = ag” (6-22) 


Multiplying Eq. (6-21) from the left by $* and Eq. (6-22) from the left by w, and 
integrating gives 


foya =a [av (6-23) 
fioa =a f yod (6-24) 

The left sides of Eqs. (6-23) and (6-24) are equal by (6-10), and 
(ai — a2) J $*y dv —0. (6-25) 


If a, Æ a, the integral vanishes. This proves that nondegenerate eigenfunctions are 
orthogonal. 
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EXAMPLE 6-2 [thas been shown (Section 6-7) that i (d /dx) is a hermitian operator. 
We know that it has eigenfunctions exp(+ikx) with eigenvalues +k, which are 
real. So far, so good. However, this operator also has eigenfunctions exp (-EKx), 


with eigenvalues tik, which are imaginary. This appears to violate the proof 
that eigenvalues of hermitian operators are real. Explain why neither of these 
eigenfunction sets is covered by the proof of section 6-8, and how one of them 
manages to obey the rule anyway. 


SOLUTION » The test for Hermiticity requires that iy*e|* oe =0. If $ is v, and if y is 
square-integrable, this condition is satisfied, because i * y vanishes at +00, giving 0 — 0 —0. But 
neither of the exponential functions given above is square-integrable: They are both unequal to 
zero at +00, so they both fall outside of the proof as given. Despite this, exp(tikx) does have 
real eigenvalues, leading us to look more closely. Is it the case that i wept = 0 for this set of 
functions, even though they do not vanish at infinity? It is indeed, since V *y = 1, giving i — i — 0 
for this term. Thus we see that our requirement that y be square integrable is more restrictive than 
what is necessary, namely that i pry tee — 0. Note that the other set of exponentials, exp(+kx), 
leads to is * y =i exp(+2kx), which does not produce a value of zero when values at x = oo and 
x = —oo are subtracted. Note also that the functions exp(-Ei kx) are orthogonal for different values 
of k, whereas the functions exp(+kx) are not. 4 


The point of the above example is that all of our proofs about eigenvalues or 
eigenfunctions of hermitian operators refer to cases where the eigenfunctions satisfy 
the requirement that iy* yI% = 0. Square-integrability guarantees this, but some 
nonsquare-integrable sets of functions can satisfy it too. A hermitian operator can have 
eigenfunctions that are associated with complex or imaginary eigenvalues, but these 
must result from eigenfunctions that do not satisfy the requirement. 


|J 6-10 Demonstration That All Eigenfunctions 


of a Hermitian Operator May Be Expressed 
as an Orthonormal Set 


If a, = a2, Eq. (6-25) is satisfied even when the integral is finite. Therefore, degenerate 
eigenfunctions need not be orthogonal. But they must be linearly independent or 
else they are the self-same function (to within a multiplicative constant), and if they 
are linearly independent, they can be converted to an orthogonal pair. Hence, it is 
always possible to express the degenerate eigenfunctions of a hermitian operator as 
an orthogonal set (and, as we have just proved, it is necessary that nondegenerate 
eigenfunctions be orthogonal). Furthermore, the functions must be square integrable, 
hence normalizable. In general, then, weare able to assume that all of the eigenfunctions 
of a hermitian operator can be expressed as an orthonormal set. 

One way to orthogonalize two nonorthogonal, linearly independent functions (which 
may or may not be eigenfunctions) will now be demonstrated. Let the functions be y 
and $ (assumed normalized) and the integral of their product have the value S: 


J v*odv=S (6-26) 
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We keep one of the functions unchanged, say y, and let ¢’ z $ — Sw be our new second 
function. y and ¢’ are orthogonal since 


f vem [ soi [72-8] y*y dv-0 (6-27) 
— Sg 
S 1 


(The new function $' needs to be renormalized.) This process, known as Schmidt 
orthogonalization, may be generalized and applied sequentially to any number of lin- 
early independent functions. 


EXAMPLE 6-3 Two normalized 1s AOs are located on nearby nuclei, A and B, and 


overlap each other enough so that f 1s41sg dv — 0.500. Construct a function from 
these two that is orthogonal to 154 and is normalized. 


SOLUTION » 1s% —155 — 0.5. 154 is orthogonal to 1s4. It is not yet normalized because 
I - [oss +0.25. 152, — 2-0.5- 1541sg) dv 
3 
—140.25—2.0.5.0.520.75— P. 


So the normalized function we seek is a (Isg — 0.5 - 15 4). 4 


NH 6-11 Proof That Commuting Operators Have 
Simultaneous Eigenfunctions 


A and B are commuting operators if, for the general square-integrable function f, 
ABf— BÀ f. This can be written (AB = BA) f =0, which requires that AB-—BA=0. 
(0 is called the null operator. It satisfies the equation, 0f —0.) This difference of 
operator products is called the commutator of A and P and is usually symbolized’ by 
[A, B]. If the commutator [À, B] vanishes, then A and B commute. 

We will now prove an important property of commuting operators, namely, that they 
have "simultaneous" eigenfunctions (i.e., that a set of eigenfunctions can be found for 
one of the operators that is also an eigenfunction set for the other operator). Let 8; be 
the eigenfunctions for B:B Pi = bij. For the moment, assume all the numbers b; are 
different (i.e., the eigenfunctions 6; are nondegenerate). Let [A, B]=0. Then 


B(AB;) = ABB; = Abi; = b;(AB;) (6-28) 


The parentheses emphasize that the function obtained by operating on 6; with A is 
an eigenfunction of B with eigenvalue b;. But that function can only be a constant 
times fj itself. Hence, for nondegenerate f; we have that A Pi = cB;, and so pj is an 
eigenfunction of A. This proves that the nondegenerate eigenfunctions for one operator 
will also be eigenfunctions for any other operators that commute with it. 


>Other less common conventions are [A, Bl. and (A, B). 
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If 8j is degenerate with other functions fj x, then we can only go so far as to say that 
Á Bi = 3 4 CkÊi,k, for this general linear combination is an eigenfunction of B having 
eigenvalue b;. But if this is so, then fj is evidently not necessarily an eigenfunction of 
A. We shall not prove it here, but it is possible to show that one can always find some 
linear combinations of fj x to produce a set of new functions, 6, that are eigenfunctions 


of A (and remain eigenfunctions of B as well). Therefore we can state that, if A 
and B commute, there exists a set of functions that are eigenfunctions for A and Ê 
simultaneously. 

An example of this property occurred in the particle-in-a-ring system described 
in Chapter 2. The hamiltonian and angular momentum operators commute for that 
system. There we found one set of functions, the trigonometric functions, that are 
eigenfunctions for Ê but not for Z,. But by mixing the energy-degenerate sines and 
cosines we produced exponential functions that are eigenfunctions for both of these 
operators. 

Another example concerns the familiar symmetry operations for reflection, rotation, 
etc. If one of these operations, symbolized R, commutes with the hamiltonian, then 
we should expect there to be a set of eigenfunctions for Å that are simultaneously 
eigenfunctions for R. It was proved in Chapter 2 that this means that nondegenerate 
eigenfunctions must be symmetric or antisymmetric with respect to R. 

A symmetry operator that leaves Å unchanged can be shown to commute with Ê. 
That is, if RH = H, then RH f= H R J, where f is any function. To show this, let R 
be, say, a reflection operator. Then R operates on functions and operators to its right 
by reflecting the appropriate coordinates: R Jfa) = fi (Rq). If A is invariant under 
reflection R, then H(q) = H(Rq), and it follows that RA(q) f(q)= = A(Rq) f(Rq) = = 
É(q)f(Rq) = = AQ fA), and so RHf= Hf. We shall formally develop the 
ramifications of symmetry in quantum chemistry in Chapter 13. 

The existence of simultaneous eigenfunctions for various operators has important 
ramifications for the measurement of a system's properties. This is discussed in 
Section 6-15. 


6-12 Completeness of Eigenfunctions of a 
Hermitian Operator 


In Chapter 3 we discussed the concept of completeness in connection with the power 
series expansion of a function. Briefly, a series of functions? ($] having certain restric- 
tions (e.g., all derivatives vary smoothly) is said to be complete if an arbitrary function 
f having the same restrictions can be expressed in terms of the series" 


DESC. (6-29) 


Proofs exist that certain hermitian operators corresponding to observable properties 
have eigenfunctions forming a complete set in the space of well-behaved (continuous, 


6A symbol in braces is frequently employed to represent an entire set of functions. 

"Equation (6-29) is overly restrictive in that it requires that the function and the series have identical values at 
every point, whereas it is possible for them to disagree at points of zero measure. However, at the level of this 
book, we can ignore this distinction and use Eq. (6-29) without encountering difficulty. 
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single-valued, square-integrable) functions. These proofs are difficult and will not be 
given here. Instead we shall introduce 


Postulate VI The eigenfunctions for any quantum mechanical operator corresponding 
to an observable variable constitute a complete set. (Furthermore, we 
have seen in Section 6-10 that we can assume that this set has been made 
orthonormal.) 


We will now use this property to investigate further the nature of the average value of 
an operator. Let the system be in some state w (normalized), not an eigenfunction of M. 
However, M possesses eigenfunctions {u} that must form a complete set. Therefore, 
we can express y in terms of ju’s: 


Y=} cimi (6-30) 
i 
Now we calculate the average value of M for the state w: 


Msc | rito | YEA cjd 
i J 
r2 3 Y f usta; t (6-31) 
i j 
But Mui 2 mini, and so 
Ma = e | uimjujdo= Y Y em; [ utn (6-32) 
i j i j 


But we are assuming that {u} is an orthonormal set, and so 


Ma =~) Y emg =y eemi (6-33) 
i j i 

What does this expression mean? Each measurement of the property corresponding 
to M must give one of the eigenvalues m; (postulate IV) and the average of many 
such measurements must be May. Equation (6-33) states how the individual measure- 
ments must be weighted to give the average, so it follows that each c;*c; is a mea- 
sure of the relative frequency for observing the corresponding m;. Putting it another 
way, the absolute squares of the mixing coefficients in Eq. (6-30) give the probabil- 
ities that a measurement of the variable M will give the corresponding eigenvalue. 
For example, if y happens to be equal to (1/4/2)u + (1/4/2)u, it follows that 
May = (01/2)mi + (0/2)ms. 


EXAMPLE 6-4 What is the average value for the z-component of orbital angular 


momentum for the normalized function ¢ = (1/ V5) (Ws pou 


SOLUTION » Since we know that the 2s and 2 p41 eigenfunctions have z-components of angu- 
lar momentum of 0 and +1 respectively, we can say at once that pz = l -0+ H -1=0.8 a.u. 
< 


8See, e.g., Kemble [1, Section 25]. 
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N 6-13 The Variation Principle 


Many ofthe calculations of quantum chemistry are based on the Rayleigh-Ritz variation 
principle which states: For any normalized, acceptable function Q, 


Hx = J *ÎĤġdrt > Eo (6-34) 


where Eo is the lowest eigenvalue of A. 
This statement is easily proved. We expand 6$ in terms of (y;), the complete, 
orthonormal set of eigenfunctions of H: 


p= api (6-35) 
i 
As in the preceding section, this leads to 


J $* Hbdt =} hci; (6-36) 


Now cfc; is never negative, and so Eq. (6-36) is merely a weighted average of the 
eigenvalues E;. Such an average can never be lower than the lowest contributing 
member and the principle is proved. 

The variation principle is sometimes stated in an equivalent way by saying that the 
average value of Å over ¢ is an upper bound for the lowest eigenvalue of H. Following 
the approach of the example at the end of the previous section, if $ for a hydrogen atom 
happens to be a function equal to (1/ V2) Wis + A/V yos, the average energy for $ 
is (1/2) E; + (1/2) E25, which obviously lies above the lowest eigenvalue Eis. 


N 6-14 The Pauli Exclusion Principle 


We have already discussed the Pauli exclusion principle in Chapter 5. In its most 
general form, this is: 


Postulate VII y must be antisymmetric (symmetric) for the exchange of identical 
fermions (bosons). 


E 6-15 Measurement, Commutators, and Uncertainty 


If we measure the exact position of the electron in a hydrogen atom, we force it into 
a state having a Dirac delta function as its wavefunction. Since this function is also 
an eigenfunction for the dipole moment operator, it follows that we also know the 
(instantaneous) dipole moment for the atom at that instant. In effect, measuring position 
measures dipole moment too. But the delta function is not an eigenfunction for the 
hamiltonian operator of the atom, and so we have not simultaneously measured the 
electronic energy of the atom. 

We have earlier seen that an eigenfunction for one operator can serve also as eigen- 
function for another operator when the operators commute. In the above example, the 
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operators for position and dipole moment commute with each other but not with the 
hamiltonian operator. This leads us to recognize that we can simultaneously measure 
values for two variables only if their operators commute. 

Let us consider this situation more deeply by imagining two successive measure- 
ments on a hydrogen atom, one immediately following the other. If we first measure 
position and find r — 2.0 a.u., and then measure dipole moment, we will get the value 
(u — 2.0 a.u.) corresponding to the electron being at r — 2.0 a.u. Thatis where we found 
itin the first measurement, and it has not had time to move elsewhere before the second 
measurement. If we immediately follow with yet another position measurement, the 
electron will still be found at r =2 a.u. (We are imagining that no time elapses between 
measurements, which is a limit we cannot actually achieve. In the present case, though, 
since measurement of r is also a measure of u, both measurements are done at once, so 
this is really not a problem.) Hence, it makes sense to say that we know these two values 
"simultaneously." However, if we first measure position and then measure energy, we 
find something very different. Suppose that we find r —2 a.u. and then, in a subsequent 
measurement, E = —1/2 a.u. (E must, after all, be an eigenvalue of H , according 
to postulate IV.) We know that the eigenfunction during the first measurement was 
ó(r — 2 a.u.), and that during the second measurement was a 1s AO. If we immediately 
do yet another position measurement, we can find any value of r (with probabilities 
given by 47 rl dr). The processes of measuring position and energy are incompati- 
ble in the sense that there is no single function that can describe the situation that exists 
during both measurements. The energy-measuring process can be pictured as forcing a 
reconstruction of the wavefunction in such a manner that it no longer corresponds to a 
particular position, while measurement of position forces a state function that does not 
correspond to a particular energy. (In this case, separate measurements would really 
be necessary, so the impossibility of doing a second measurement truly immediately 
after the first must be recognized. Indeed, one has to allow for the fact that finding an 
electron in one place and then at some other place must imply a lapse of time permitting 
the electron to travel.) 

The reader may suspect that there is some connection between commutators and the 
uncertainty principle, and this is indeed the case. It can be shown? that the product 
of widths of simultaneous measurements (i.e., the uncertainty in their values) of two 
variables satisfies the relation 


Aa-Ab> ; f y* |4. | ydr (6-37) 


where y is normalized, and the absolute value |X| is defined as the positive square 
root of X* X. If A and B are conjugate variables, such as position and momentum, 
Eq. (6-37) becomes Aa - Ab > h/2, which is Heisenberg's uncertainty relation. If A 
and B commute, the right-hand side of Eq. (6-37) vanishes, and the values of both 
variables may, in theory, be simultaneously known exactly. 

Among the properties of greatest interest in molecular quantum mechanics are 
energy, symmetry, and electron orbital angular momentum because, for many 
molecules, some of these operators commute. Thus, if we know that an oxygen 
molecule is in a nondegenerate stationary electronic state, we know that it is possible 
to characterize that state by a definite value of the orbital angular momentum along 


2 See Merzbacher [2, Section 10-5]. 
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the internuclear axis. Also, we know that the wavefunction must be symmetric or 
antisymmetric for inversion through the molecular midpoint. 


6-16 Time-Dependent States 


Much of quantum chemistry is concerned with stationary states, for which W is a 
product of a space term yw (an eigenfunction of At) and a time-dependent factor 
exp(—i Et/h), which we usually ignore because it has no effect on particle probability 
distribution. Sometimes, however, it becomes necessary to consider time-dependent 
states. In this section we illustrate how some of these may be treated. 

There are two types of situation to distinguish. One is situations where the potential 
is changing as a function of time, and hence the hamiltonian operator is time dependent. 
An example is a molecule or atom in a time-varying electromagnetic field. The other 
is situations where the potential and hamiltonian operator do not change with time, 
but the particle is nonetheless in a nonstationary state. An example is a particle that is 
known to have been forced into a nonstationary state by a measurement of its position. 
We deal here with the second category. 

As our first example, consider a particle in a one-dimensional box with infinite walls. 
Suppose that we measure the particle's position and find it in the left side of the box 
(i.e., between x — 0 and L/2; we will be more specific shortly) at some instant that 
we take to be £ — 0. We are interested in knowing what this implies about a future 
measurement of the particle's position. 

Our knowing that the particle is on the left side at t = 0 means that the wavefunction 
for this state is not one of the time-independent box eigenfunctions we saw in Chapter 2, 
because those all predict equal probabilities for finding the particle on the two sides 
of the box. If the state function is not stationary, it must be time dependent, and it 
must satisfy Schródinger's time-dependent equation (6-1). We have, then, that the state 
function is time dependent, and that Y* = |W|? is zero everywhere on the right side 
of the box when t = 0. (We have not yet been specific enough to describe |Y]? in detail 
on the left side of the box.) 

Schródinger's time-dependent equation (6-1) is not an eigenvalue equation. How- 
ever, Eq. (6-2) shows that Schródinger's time-dependent equation is satisfied by time- 
independent eigenfunctions of Hif they are multiplied by their time-dependent factors 
f(t) =exp(—iEt/h). Furthermore, Eq. (6-2) continues to be satisfied if the term 
V (q) f (t) is replaced by a sum of such terms. (See Problem 6-9.) This means that 
we can seek to express the time-dependent state function, V (x, f), as a sum of time- 
independent box eigenfunctions as long as each of these is accompanied by its time 
factor f(t). When t — O, all the factors f(t) equal unity, so at that point in time V 
becomes the same as the sum of box eigenfunctions without their time factors. 

Our strategy, then, is to find a linear combination of time-independent box eigen- 
functions, y,, that describe V when f — 0. This is easy to do because the time factors 
are all equal to unity. Once we have found the proper mixture of Yn, we multiply each 
by its time factor and then observe the behavior of |Y |? as t increases. 

In the case of our particle-in-a-box example, we can start with a very simple approx- 
imation to w at t = 0 by taking a 50-50 mixture of y and y»: 


Yx, t) = (0/42) [Vi exp(—i E1t/h) + v exp(—i E2t /h)] (6-38) 
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V2/L 


V2/L 7 


v(0) 0 


Figure 6-1 » Stationary eigenfunctions (n = 1, 2) for the particle in a box and their normalized sum. 


We have included the functions f(t), even though they equal unity when t = 0, because 
they are needed to make V (x, t) a solution to Schródinger's equation (6-1) and because 
they will inform us of the nature of W at later times. 

We choose this pair of functions because, when ¢ = 0, both are positive on the left, 
but they differ in sign on the right, giving us some cancellation there. (See Fig. 6-1.) 
Obviously, we have not succeeded in describing a function that has no probability 
density on the right, but we already have a definite imbalance in that direction. (It is 
not difficult to see that some w3 with a positive coefficient should help remove much 
of the remaining probability density on the right.) 

Now we are in a position to examine this |Y |? as time progresses—the time evolution 
of the square of a wavepacket that describes the probability distribution for a particle 
that is known to have been in the left half of the box at t — 0. This is mathematically 
straightforward (Problem 6-20) and leads to the probability distributions sketched in 
Fig. 6-2 after time steps of At. The figure shows a changing distribution suggestive of 
the particle bouncing back and forth in the box with a cycle time of 8 At. Itis not difficult 
to see why this happens. W; and V^» have different “frequency factors" exp(—i E,t/h), 
so they behave like two waves oscillating at different frequencies. Since E? = 4£i 
(recall E x n? in the box), V» oscillates four times faster than Vj. This means that, by 
the time WY; has made half a cycle (and is equal to —1 times its starting coordinates), 
V has made two cycles and is just as it was at f = 0. It is easy to see from Fig. 6-1 
that this will give a V that is skewed to the right, leading to the distribution shown in 
Fig. 6-2(e). (This allows us to conclude that 4At equals 1/2 of the cycle time of V. 
See Problem 6-21.) 

If we want a more accurate starting representation for the localized particle, we must 
mix together a larger number of stationary-state wavefunctions. In order to decide 
how much of each is needed, we must have a better-defined description of V at t = 0. 
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t=O  — —- At ———- 2At —~ 3t —— —- 4At 


8At -—— n 7At -——— 6At -—— — 5At aoo 
(a) (b) (c) (d) (e) 


Figure 6-2 » |W(x, 0p? from Eq. (6-38) as it appears at various times. 
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Figure 6-3 » A normalized half sine wave in the left half of a “box.” The numbers at left are values 
of V, not of E. 


Suppose, for instance, we choose to describe the starting wavefunction V(x,0) as a 
normalized half sine wave in the left side of the box and zero at the right (Fig. 6-3). 
Then we can calculate the amount (c,) of each of the stationary-state functions v; 
present in this function as follows: 


L 
a f Wn V (x, 0)dx (6-39) 
0 


This follows from the completeness!” and orthonormality of {Yn}. (See Problem 6-4.) 
Evaluation of Eq. (6-39) for the first few terms gives (Problem 6-22) 


V (x, f) 2 0.600971 4- 0.7072 -- 0.3603 + 0.0004 4- 0.08635 ----- — (6-40) 


This modifies slightly our earlier commonsense combination and also verifies our pre- 
diction that y3 times a positive coefficient would be beneficial. 


lÜBecause V (x, 0) has a discontinuous derivative at the midpoint of the box, it falls outside the class for which 
{wn} is complete. However, because this problem is restricted to dx around one point, it should have little effect. 
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This example illustrates the basic approach to such problems: 
1. Find a function that represents the initial particle distribution V (x, 0). 


2. Expand that function as a series of eigenfunctions for the hamiltonian, and include 
the time-dependent factor for each term. 


3. Evaluate the probability distribution at other times t by examining |W (x, t)|?. 


As a second example, suppose one were considering the behavior of the electronic 
state immediately after a tritium atom emits a beta particle to become a helium ion: 
1 H —> ; He +] e. A crude analysis could be attempted by imagining that the nuclear 
charge suddenly changes from 1 to 2 a.u., and the orbiting electron (not the beta particle) 
suddenly finds itself in a state (the original 1s state) that is not an eigenfunction for the 
new hamiltonian. We would accordingly set V (t = 0) to be the 1s AO of hydrogen and 
expand this in terms of Het eigenfunctions. Only s-type AOs could contribute because 
of symmetry. The coefficients are given by 


allspace 
Cn =f Vis CZ = 1)Yns(Z —2)dv (6-41) 
and the time-dependent wavefunction is (in a.u.) 


V(r, 0, h, 0) = c1Wis(Z —2) exp(—2it) + esos (Z = 2) exp(—it/2) 
+03W35(Z =2) exp(—2it/9) 4- --- (6-42) 


This function could be evaluated at various times ¢ and would be found to give an 
oscillating spherical distribution, as though the electron cloud were shrinking, then 
rebounding to its original distance, then shrinking again, etc. 

Our next example is perhaps the most important: It is a particle initially localized in 
some region of space, say by measurement of its position, and free to move anywhere 
thereafter. Taking the one-dimensional case, we imagine that the particle has been 
detected around x = 0 at t = 0 (the measurement caused it to be “unfree” for an instant). 
We assume that the average momentum of the particle is zero. We seek to know how 
the probability distribution function for the particle will evolve in time. 

As before, we need a functional description of the wavefunction at t = 0, Y (x, 0). 
We will then expand that in terms of eigenfunctions of the free-particle hamilto- 
nian. As we have seen in Section 2-5, the free-particle eigenfunctions may be written 
exp(Xi4/ 2m Ex/h), where E is any nonnegative number. These are also eigenfunctions 
for the momentum operator, with eigenvalues /2m Eh. 

The function usually selected to describe V (x, 0) is a gaussian function: 


V(x,0)— (7272) exp(—ox2) (6-43) 


The constant o affects the width of the gaussian and reflects our degree of certainty 
in our knowledge of position. Large o gives a tight function and small uncertainty 
Ax. The relationship between the gaussian function in x and the coefficients of the 
eigenfunctions as a function of /2m£/h is depicted in Fig. 6-4. Remarkably, the 
coefficient values are also described by a gaussian function (in J/ 2m E /h). Furthermore, 
the tighter the gaussian function is in x, the broader the corresponding gaussian function 


184 


Chapter 6 Postulates and Theorems of Quantum Mechanics 


0 X—— 0 V2mEIh —- 
(a) (b) 
0 x 0 v2mE/T—»— 
(c) (d) 


Figure 6-4 » (aandc) Gaussian wave packets describing particles found to be at x = 0 with differing 
degrees of certainty. (b and d) Values of cj, (where k = v 2m E /ħ) for momentum eigenfunctions that 
combine to express the gaussian wave packets to their left. (a, b) corresponds to relatively certain 
position and relatively uncertain momentum, whereas (c, d) corresponds to the opposite situation. 


is in /2m E/h (Problem 6-24). That is, we need to combine free-particle eigenfunction 
contributions from a wider range of momenta to create a tighter position function. This 
means that greater certainty in position goes with greater uncertainty in momentum, in 
accord with the uncertainty principle. 

Once we have the appropriate mixture of momentum eigenfunctions, each with its 
time-dependent term, we can follow the time evolution of the particle wave packet. 
We find that the packet spreads out more and more about x = 0 as time passes, which 
means that our knowledge of position is decreasing as time passes. Even though the 
average position is not changing, the probability for finding the particle at a distance 
from x — O is increasing. 

We can interpret this by remembering that the square of the wavefunction predicts 
the results of many experiments. In each of many position measurements finding the 
particle near x — 0, we impart some degree of momentum to the particle. Then, in 
a second measurement, we find that some of the particles have moved away from 
x —0. The longer we wait before taking the second measurement, the greater this 
spread in x values. (Our assumption of zero average momentum amounts to saying that 
large deviations from zero momentum are equally likely for motion toward x = +00 
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and —oo.) The more precise our first position measurement is, the greater the likelihood 
of introducing momenta quite different from zero and the more rapidly the wave packet 
spreads out as time passes. 

In the first example, a packet located in half of a one-dimensional box, we saw |W|? 
oscillate back and forth, changing shape in the process so that the motion cannot be 
described with a single frequency. A related important case is that of an oscillator 
moving in a harmonic potential. Let us assume the oscillator's position at t =O to be 
described by a gaussian wavefunction. If this function is not centered at the oscilla- 
tor's equilibrium position, we have a time dependent situation analogous to the above 
particle-in-a-box case. For a harmonic potential, it can be shown!! that |W|? remains 
a gaussian function as time passes (i.e., does not change shape), and that the center 
of this gaussian oscillates back-and-forth about the equilibrium position with the clas- 
sical frequency. This is a situation of interest to spectroscopists because it bears on 
the process of electronically exciting a sample of diatomic molecules. Suppose the 
molecules are initially in their ground electronic and ground vibrational states. Then 
their vibrational wavefunction is a simple gaussian function (the lowest-energy har- 
monic oscillator wavefunction) centered at Re (i.e., not off-center). If the molecules 
are excited by a laser pulse to a new electronic state having an equilibrium internuclear 
distance of R;, the vibrational wavefunction at f = O is still the simple gaussian cen- 
tered at Re, which means that it is now off-center. As time passes, the center of this 
function oscillates back and forth about R; in a coherent manner (i.e., describable with 
a single frequency). Thus, we have gone from an initial state describing an ensem- 
ble of molecules vibrating about Re with zero-point energy hu/2 and with random 
phases (i.e., a time-independent state wherein such measurable properties as average 
molecular dipole moment appear to be constant) to a final state where the molecules are 
vibrating in phase about R’, (a time-dependent state, wherein one might expect to see 
time variation of such properties). If Re were quite a bit smaller than R’, for instance, 
then almost all the molecules would find themselves to be “too short" at t = 0, “too 
long" a short time later, etc., as they swing in phase about R7. As a result of this simple 
behavior, it is possible to take advantage of it and time a second laser pulse to strike the 
molecules when they are almost all at their shortest, or all at their longest, extension. 
Of course, in real molecules the potential is not exactly harmonic. Furthermore, phase 
coherence is eventually lost due to collisions. So the second pulse must come very 
soon after the first one. 


6-17 Summary 


Some of the postulates and proofs described in this chapter are most important for what 
follows in this book, and we list these points here. 


1. w describes a state as completely as possible and must meet certain mathemati- 
cal requirements (single-valued, etc.). /* wy is the probability density distribution 
function for the system. 


2. For any observable variable, there is an operator (hermitian) which is constructed 
from the classical expression according to a simple recipe. (Operators related to 


11 See Schiff [3, pp. 67, 68], and Tanner [4]. 
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"spin" are the exception because the classical analog does not exist.) The eigenvalues 
for such an operator are the possible values we can measure for that quantity. The act 
of measuring the quantity forces the system into a state described by an eigenfunction 
of the operator. Once in that state, we may know exact values for other quantities 
only if their operators commute with the operator associated with our measurement. 


3. If the hamiltonian operator for a system is time independent, stationary eigenfunc- 
tions exist of the form y (q, w) exp(—i Et/h). The time-dependent exponential does 
not affect the measurable properties of a system in this state and is almost always 
completely ignored in any time-independent problem. 


R 


The formula for the quantum-mechanical average value [Eq. (6-9)] is equivalent to 
the arithmetic average of all the possible measured values of a property times their 
frequency of occurrence [Eq. (6-33)]. This means that it is impossible to devise a 
function that satisfies the general conditions on y and leads to an average energy 
lower than the lowest eigenvalue of H. 


5. 'The square-integrable eigenfunctions for an operator corresponding to an observable 
quantity form a complete set, which may be assumed orthonormal. The eigenvalues 
are all real. 


6. Any operation that leaves Ê unchanged also commutes with Ê. 


7. Wavefunctions describing time-dependent states are solutions to Schródinger's time- 
dependent equation. The absolute square of such a wavefunction gives a particle 
distribution function that depends on time. The time evolution of this particle distri- 
bution function is the quantum-mechanical equivalent of the classical concept of a 
trajectory. Itis often convenient to express the time-dependent wave packet as a lin- 
ear combination of eigenfunctions of the time-independent hamiltonian multiplied 
by their time-dependent phase factors. 


6-17.A Problems 
6-1. Prove that d? /dx? is hermitian. 


6-2. Integrate the expressions in Eqs. (6-13) and (6-15) to show that their integrals are 
equal. 


6-3. Prove that, if a normalized function is expanded in terms of an orthonormal set of 
functions, the sum of the absolute squares of the expansion coefficients is unity. 


6-4. Show that a particular coefficient c; in Eq. (6-30) is given by cy = f uz V dv. 
6-5. A particle in a ring is in a state with wavefunction y = 1/./7 cos(2@). 


a) Calculate the average value for the angular momentum by evaluating 
f W* Lzy do, where L; = (h/i) d/dó. (Use symmetry arguments to evaluate 
the integral.) 

b) Express y as a linear combination of exponentials and evaluate the average 


value of the angular momentum using the formula Lz 4, =); ci cjL; where 
Lz is the eigenvalue for the ith exponential function. 
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6-6. 


6-7. 


6-8. 


6-9. 


6-10. 


6-12. 


6-13. 


6-14. 


6-15. 


Using Eq. (6-37), show that Ax - Ap; > h/2. 


What condition must the function ó satisfy for the equality part of > to hold in 
Eq. (6-34)? 


Suppose you had an operator and a set of eigenfunctions for it that were associated 
with real eigenvalues. Does it necessarily follow that the operator is hermitian 
as defined by Eq. (6-10)? [Hint: Consider d/dr and the set of all functions 
exp(—ar), where a is real and positive definite.] 


a) Show that the nonstationary state having wavefunction 
V = (1/V2)yis expGt/2) + (1/2) rog, expGt /8) 


is a solution to Schródinger's time-dependent equation when is the time- 
independent Å of the hydrogen atom. Use atomic units (i.e., A = 1). 

b) This time-dependent state is dipolar and oscillates with a characteristic fre- 
quency v. Show that v satisfies the relation E» — E1 = AE = 27v in a.u. 
(The dipole oscillates at the same frequency as that of light required to drive 
the 1s <—> 2p transition. This is central to the subject of spectroscopy.) 


From the definition that ¢’ = $ — Sw [see the discussion following Eq. (6-26)], 
evaluate the normalizing constant for $', assuming that $ and y are normalized. 


. Given the two normalized nonorthogonal functions (1/4/z)exp(—r) and 


A/l/3zrexp(—r), construct a new function $ that is orthogonal to the first 
function and lies within the function space spanned by these two functions, and 
is normalized. 


If the hydrogen atom 1s AO is expanded in terms of the Het AOs, what is the 
coefficient for (a) the Het 1s AO? (b) the Het 2po AO? 


The lowest-energy eigenfunction for the one-dimensional harmonic oscillator is 
Vno = (B/1)'"" exp(—Bx*/2). 


a) Demonstrate whether or not momentum is a constant of motion (i.e., is a 
“sharp” quantity) for this state. 
b) Calculate the average momentum for this state. 


Demonstrate whether x?d /dx and xd? /dx? commute. What about xd/dx and 
x?d? [dx?? 


Evaluate the following integrals over all space. In neither case should you need 


to do this by brute force. 


a) f Gd.) E? 3d.) dv 
b) f (34) L: Gd.) dv 
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6-16. 


6-17. 


6-18. 


6-19. 


6-20. 
6-21. 
6-22. 


6-23. 


6-24. 


6-25. 
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The operators for energy and angular momentum for an electron constrained to 
move in a ring of constant potential are, respectively, in a.u. —(1/2)d?/d$? and 


(1/)d/dà. 


a) Discuss whether or not there should be a set of functions that are simultane- 
ously eigenfunctions for both operators. 

b) Discuss whether or not there is a set of functions that are eigenfunctions for 
one of these operators but not the other. 

c) Discuss whether it is reasonable to expect these two physical quantities to be 
exactly knowable simultaneously or whether the uncertainty principle makes 
this impossible. 


Suppose a hydrogen atom state was approximated by the function $ = (1/4/3)1s 
+(1/V73)2s + (1/4/3)3s, where 1s, 2s, and 3s are normalized eigenfunctions for 
the hydrogen atom hamiltonian. What would be the average value of energy 
associated with this function, in a.u.? 


A function f is defined as follows: f=0.1-1s+0.2-2p; + 0.3 - 3d2, where 
1s is the normalized eigenfunction for the 1s state of the hydrogen atom, etc. 
Evaluate the average value of the z component of angular momentum in a.u. for 
this function. 


Without looking back at the text, prove that (a) eigenvalues of hermitian operators 
are real, (b) nondegenerate eigenfunctions of hermitian operators are orthogonal, 
(c) nondegenerate eigenfunctions of A must be eigenfunctions of B if Aand B 
commute. 


Using Eq. (6-38), obtain an expression for |Y |? as a function of x and f. 
Evaluate At of Fig. 6-2 in terms of m, h, and L. 


Verify the values of the coefficients in Eq. (6-40). How can you tell from simple 
inspection that (a) c2 will be largest and positive, (b) c4 will be zero, (c) c; will 
tend toward small values at large i? 


a) Evaluate the first two coefficients in Eq. (6-42). 

b) What qualitative difference would you expect between Y of Eq. (6-42) and 
one that takes explicit account of the changing potential resulting as the beta 
particle travels away from the nucleus? 


Show by qualitative arguments based on mathematical functions why coefficients 
ck Should drop off more rapidly with & if the position wave packet is broader. 


c = f (J2afn)expC-ox?) exp(ikx) dx. 


a) Prove that, if V isrealand V (x, y, z, t) satisfies Schródinger's time-dependent 
equation, then V (x, y, z, —t)* is also a solution. (This is called “invariance 
under time reversal." 

b) Show that, for stationary states, invariance under time reversal means that 
Ay* = Ey* if HW = Ey and if V is real. 

c) Show from (b) that nondegenerate eigenfunctions of H (with real V) must be 
real. 


Section 6-17 Summary 189 


d) What becomes ofthe 2p. | AO (with time dependence included) upon complex 
conjugation and time reversal? the 2po AO? 

e) Can the statement in (c) be reworded to say that all degenerate eigenfunctions 
of fI (with real V) must be complex? 


Multiple Choice Questions 
(Try to answer these without referring to the text.) 


1. Which one of the following statements about the eigenfunctions of a time- 
independent Hamiltonian operator is true? 


a) Any linear combination of these eigenfunctions is also an eigenfunction for A. 
b) The state function for this system must be one of these eigenfunctions. 

c) The eigenvalues associated with these eigenfunctions must all be real. 

d) These eigenfunctions must all be orthogonal to one another. 

e) These eigenfunctions have no time dependence. 


2. A hydrogen atom is in a nonstationary state having the wavefunction Hyas 
exp(it/2h) + os exp(it/8h) + Vp, exp(it/8) + W2p,, expGr/85)) 


Which statement is true at any time t? 


a) A measurement of the energy has a 25% chance of giving —0.5 a.u. 

b) The average value of the z component of angular momentum is 0.5 a.u. 

c) The average energy is -7 a.u. 

d) A measurement of the total angular momentum has a 75% chance of giving 


V2 a.u. 


e) None of the above statements is true. 


3. The function rexp(—0.3r7) cos 6 is expanded in terms of hydrogen atom wavefunc- 
tions. This series may have finite contributions from bound-state eigenfunctions 


a) of all types: s, py, Py, Pz, dx y, d yz, etc. 
b) of all types except s. 


c) of types py, Py, pz only. 
d) of type p. only. 
e) of types p; and d? only. 
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The Variation Method 


LJ 7-1 The Spirit of the Method 


The proof of the Rayleigh-Ritz variation principle (Section 6-12) involves essentially 
two ideas. The first is that any function can be expanded into a linear combination of 
other functions that span the same function space. Thus, for example, exp(ikx) can 
be expressed as cos(kx) +isin(kx). An exponential can also be written as a linear 
combination of powers of the argument: 


exp(x) 21 +x 4-x2/21 - x2 /31 - E x" /n! +- (7-1) 


The second idea is that, if a function is expressed as a linear combination of eigenfunc- 
tions for the energy operator, then the average energy associated with the function is a 
weighted average of the energy eigenvalues. For example, if 


$= (=) jd (=) yo (7-2) 


AW=EiW, yn=Exn, EZE (7-3) 


then measuring the energy of many systems in states described by $ would give the 
result E; half of the time and £» the other half. The average value, IE it 5 E> must 
lie between E; and E>. Alternatively, if 


aog : 
$ = 3 V1 32 (7-4) 


measurements would give E, one-third of the time, and E; the rest of the time, for an 
average that still must lie between E, and E». It should be evident that, even when $ 
is a linear combination of many eigenfunctions y;, the average value of E can never 
lie below the lowest or above the highest eigenvalue. 

The variation methodis based on the idea that, by varying a function to give the lowest 
average energy, we tend to maximize the amount of the lowest-energy eigenfunction 
Wo present in the linear combination already discussed. Thus, if we minimize 


[f Hio dv 
[4*6 dv 
the resulting $ should tend to resemble wo since we have maximized (in a sense) the 

amount of yo in $ by this procedure. 


where 


E= (7-5) 
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N 7-2 Nonlinear Variation: The Hydrogen Atom 


We have already seen (Chapter 4) that the lowest-energy eigenfunction for the hydrogen 
atom is (in atomic units) 


1 
Vis = (=) exp(—r) (7-6) 


Suppose we did not know this and used the variation method to optimize the normalized 


trial function 
£e 
o= E exp(—¢r) (7-7) 


In this example, when ¢ = 1, $ becomes identical to ws, but in more complicated sys- 
tems the trial function never becomes identical to an eigenfunction of the hamiltonian. 
Nevertheless, this is a good example to start with since there are few mathematical 
complexities to obscure the philosophy of the approach. 

The variation method requires that we minimize 


E= J $* Bo dv (7-8) 


by varying ¢. [ġ is normalized, so no denominator is required in Eq. (7-8).] Since the 
trial function $ has no 6- or ¢-dependent terms for V? to operate on, only the radial 
part of V? is needed in H. Thus [from Eq. (4-7)] 


"- TA 
H= 7-9 
2 dr r fom) 


According to Eq. (7-8), we need first to evaluate the quantity Ĥo: 


Loop Dee cg d 123 
fio =| QP dr dr ren en) oa) 


>i 2 3 
-|5 ) ent £r) (7-11) 


Incorporating this into Eq. (7-8) gives (after integrating 0 and ¢ in dv to give 47r) 


3 oo = 2 
E=4n II E D 2 IET 28r) r? dr (7-12) 
0 


T r 
oo 2 oo 
= 4g? le — Df r exp(—2£r) dr — (5) Í r^ exp (—2£r) ar| (7-13) 
0 0 


Using the integral table in Appendix 1, we obtain 


2 
£- (2) e (7-14) 
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Now we have a simple expression for E as a function of £. To obtain the minimum, 
we set the derivative of E to zero: 
dE 
—=0=¢-1 7-15) 
dt g ( 
As we expected, ¢ = 1. Inserting this value for ¢ into Eq. (7-14) gives E = -j a.u., 
which is identical with the lowest eigenvalue for the hydrogen atom. 

This example demonstrates that minimizing E for a trial function causes the function 
to become like the lowest eigenfunction for the system. But it is more realistic to 
examine a case where the trial function is incapable of becoming exactly identical with 
the lowest eigenfunction. Suppose we assumed a trial form (normalized) of 


£5 
o=,/ ——rexp(—ér) (7-16) 
Y 3x 


Proceeding as before, we first evaluate H op: 


fie=|(-3) += - Ele (7-17) 
This leads to 
£o-;E-$ (7-18) 
so that 


dE 4 E | (7-19) 


4 8 


and E is a minimum when ¢ = 3. Thus, our energy-optimized $ is 


35 3r 
$= ge P (-=) (7-20) 


This is obviously not identical to the eigenfunction given by Eq. (7-6), but it must be 
expressible as a linear combination of hydrogen atom eigenfunctions, and the amount 
of Yis present should be quite large unless the trial form was unwisely chosen. Since 
¢ also must contain contributions from higher energy eigenfunctions, it follows that E 
must be higher in energy than -i a.u. We test this by inserting £ = 3 into Eq. (7-18), 
obtaining an E of -3 a.u., (—0.375 a.u.). This value is above the lowest eigenvalue, 
but it is well below the second-lowest eigenvalue Ci a.u.) associated with 2s, 2p 
eigenfunctions, and so we know that $ does indeed contain much 1s character. We can 
find out exactly how much 1s eigenfunction is contained in ¢ by calculating the overlap 
between ¢ and the 1s eigenfunction. That is, since the 1s function is orthogonal to all 
the other hydrogen atom eigenfunctions; 


SW 0 jÆls : 
[noL Jis (7-21) 
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and since 


$-—oswisct cj; (7-22) 


J#ls 


it follows that 


1 0 
| Vabde = e, [udo t Y Cj isWjdv = Cy, (7-23) 


Integrating f Wish dv, where $ is given by Eq. (7-20), gives c1, = 0.9775, so $ does 
indeed “contain” a large amount of Yis, (If cis = 1, then Yis and $ are identical.) 

This suggests another way of trying to get a "best" approximate wavefunction. We 
could find the value of ¢ that maximized the overlap between $ and yis. This maximizes 
Cis in Eq. (7-22). If one does this (Problem 7-6), one obtains ¢ = 3, which corresponds 
to an overlap of 0.9826 and an E of —0.370 a.u. At first sight, this seems puzzling. 
Aa = 3) has a larger amount of Wis in it, but AG = 3) has a lower average energy. 
But it is really not so unreasonable. Maximizing the value of cj, causes ¢ to take on 
a certain value that may cause the coefficient for some high-energy state (say, 6s) to 
become relatively larger, producing a tendency to raise the average energy. On the 
other hand, minimizing E is a process that is implicitly concerned with what all the 
coefficients are doing. This allows for a different sort of compromise wherein c;, may 
be allowed to be a bit smaller if the associated energy loss is more than compensated 
for by a favorable shifting in values of higher coefficients (e.g., if c», increases and ce; 
decreases). The purpose of this discussion is to emphasize that the variation method 
optimizes the trial function in a certain sense (best energy), but that other kinds of 
optimization are conceivable. The optimization that gives best overlap is not generally 
useful because it requires that we know the exact solution to begin with. If we vary 
& to obtain optimal agreement for r, or V, or r?, we would find a different value of 
& appropriate for each property. In each case, however, we would need to know the 
correct value of r, etc., before starting. A great virtue of the energy variation method 
is that it does not require foreknowledge of the exact eigenvalue or eigenfunction. 
However, the function that gives the lowest value for E might not be especially good in 
describing other properties. This is demonstrated in Fig. 7-1 and Table 7-1. The figure 
indicates that the trial function differs from the exact function chiefly near the nucleus, 
where r « 1. This discrepancy shows up when we compare average values for various 
powers of r. The operators r, r?, and r° become large when r is large. Thus, these 
operators magnify y? dv at large r. Since the two functions are fairly similar at large 
r, the average values show fair agreement. But r^! and r^? become large when r is 
small. Thus, the fact that the approximate function is too small at small r shows up 
as a marked disagreement in the average value of r7?, this average being much larger 
for the exact function (see Table 7-1). Any trial function that is known to be especially 
inaccurate in some region of space (e.g., at small r) will give unreliable average values 
for operators that are largest in that region of space (e.g., r ^). 

Choosing a trial form such as Eq. (7-16), which must vanish at r — 0, might seem 
foolish since we know that yis does not vanish at r = 0. And if we were interested in 
electron density at the nucleus for, say, calculating the Fermi contact interaction, this 
would indeed be a self-defeating choice. Butif our interest is in energies or other proper- 
ties having operators that are large in regions where the trial function is not too deficient, 


194 Chapter 7 The Variation Method 


r (bohrs) 


Figure 7-1 » Plots of Yis and $ [Eq. (7-20)] versus r. 


TABLE 7-1 » Comparison between Exact Values for Some Properties of (1s) Hydrogen and 
Values Calculated from the Function Eq. (7-20) 


Quantity Exact (1s) value (a.u.) Trial function value (a.u.) 
i 7 -i 
Electron density 
at nucleus i 0 

r 1.5 1.67 

r2 3.0 3.33 

r3 7.5 7.78 

rot 1.0 0.75 

"I 2.0 0.75 


this choice would serve. One often settles for a mathematically convenient form even 
though itis known to beinadequate in some way. Care must then be exercised, however, 
to avoid using that trial wavefunction in ways which emphasize its inadequacies. 


E 7-3 Nonlinear Variation: The Helium Atom 


We mentioned in Chapter 5 that the ground-state wavefunction 1s(1)1s(2) for helium 
was much too contracted if the 1s functions were taken from the He* ion without 
modification. Physically, this arises because, in He*, the single electron sees only a 
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doubly positive nucleus, whereas in He each electron sees a doubly positive nucleus 
and another electron, so that in He the repulsion between electrons prevents them from 
spending as much time near the nucleus as in He*. Somehow, the 1s functions should 
be modified to reflect this behavior. We will now show how the variation method may 
be used to accomplish this. 

The form of the hydrogenlike ion 1s solution is 


Z3 
Iszj mx ae (7-24) 


For Het, Z —2, but we have just seen that this gives a function that is too contracted. 
Smaller values of Z would cause the function to die away more slowly with r. Therefore, 
it is reasonable to replace the atomic number Z with a variable parameter ¢ and find 
the value of ¢ that gives the lowest average energy. Hence, we let 


3 
Is (Des dp (7-25) 
T 
and our trial wavefunction is 


$0.2 =1 M10) (1/2) [ee - &m«o] (7-26) 


1 


The average energy is [since $ (1, 2) is normalized] 
B= f [ a DAADAA Dardo) (7-27) 


Since (1,2) contains no spin operators at our level of approximation, the integral 
separates into an integral over the space coordinates of both electrons and an integral 
over the spin coordinates of both electrons. The integration over spins gives a factor of 
unity. There remains 


E= f f was @Aa.218 ais oiov) (1-28) 


H(,2)= ly? ly (=) ES) 7-29) 
; B 2; 1 2 2 ri r? ! r12 (7 


and where the 0 and ¢ parts of V? can be ignored since (1, 2) is independent of these 
variables. The calculation is easier if we recognize that 


where 


X : z 1 
ÊN, D= fc (1) + Hyer 2) + — (7-30) 
12 


lIn this trial function, £ has the same value in each atomic orbital. This is not a necessary restriction. There is no 
physical reason for not choosing the more general trial function where orbitals with different ¢ are used. Symmetry 
requires that such a function be written 27 I2 [1s (1)1s" (2) + 1s" (1)1s/ (2)]2- 2? [o (1)8 (2) — 8(1)a(2)]. This 
type of function is called a “split shell” wavefunction. It gives a lower energy for He than does the function (7-26). 
However, for most quantum-chemical calculations split shells are not used, the gain in accuracy usually not being 
commensurate with the increased computational effort. 
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This allows us to express Eq. (7-28) as the sum of three integrals, the first of these being 
II 1s (1)1s' (2) f, (1) Is'(D)1s'Q)dv(1)dv(2) (7-31) 


Since the operator in the integrand operates only on coordinates of electron 1, we can 
separate this into a product of two integrals: 


J siens ovo f coy ais eo) (7-32) 


The 1s' functions are normalized, and so the first integral is unity. The second 
integral is almost identical to the integral in Eq. (7-8) and has the value (¢ 2 /2) — 26. 
Therefore, the first of the three integrals mentioned above equals (E a /2) — 26. The 
second of the three integrals is identical with Eq. (7-31) except that the operator acts 
on electron 2 instead of 1. This integral is evaluated in the same manner and gives 
the same result. The third integral, in which the operator is 1/712, is more difficult to 
evaluate. This interesting and instructive problem constitutes a detour from the main 
sequence of ideas in this chapter and is therefore discussed in Appendix 3. We here 
simply take the result, Ia , and proceed with the variation calculation. 

We now have an expression for E as a function of ¢: 


5 


E-2[(7/2) -2:] c 36 =% - 4 (7-33) 
Minimizing E with respect to ¢ gives 
EE =0=2¢ zal (7-34) 
7 a 8 
so that 
4 (7-35) 
S t 


This value of £ is smaller than the unmodified He* value of 2, as we anticipated. Let us 
see how much the average energy has been improved. According to Eq. (7-33), when 
& is 2, E is equal to —2.75 a.u. When £ — i. E equals —2.848 a.u., so the average 
energy has been lowered by approximately 0.1 a.u., or 2.7 eV, or 62 kcal/mole. (The 
exact nonrelativistic energy for He is —2.903724377 a.u.) Further analysis would show 
that, by decreasing £ , we have decreased the average kinetic energy (the less compressed 
wavefunction changes slope less rapidly), raised the nuclear-electron attraction energy 
from a large negative to a smaller negative value (the decreased attraction resulting 
from the electrons being farther from the nucleus, on the average), and decreased the 
interelectronic repulsion energy from a higher positive value to a lower one. The 
variational procedure has allowed the wavefunction to adjust to the best compromise it 
can achieve among these three factors. If becomes less than m the loss of nuclear- 
electron attraction is too great to be offset by the loss of interelectronic repulsion and 
kinetic energy. 

Varying a parameter in the argument of an exponential produces a nonlinear change 
in the function, and so calculations of the type described above are referred to as 
nonlinear variation calculations. Such calculations tend to become mathematically 
complicated and are not frequently used except for fairly simple systems. The fact 
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that the hamiltonian operator is a linear operator [i.e., H (cio, + 0292) = (ci E pı + 
c2 1¢2)] makes a linear variation procedure more convenient for most purposes. 


N 7-4 Linear Variation: The Polarizability 


of the Hydrogen Atom 


Suppose we wish to express a wavefunction y (which may be approximate) as a linear 
combination of two known functions $, and $»: 


V (c1, 02) = €161 + e292 (7-36) 


The question is, what values of c, and cz give a w that best approximates the exact 
wavefunction for a particular system? The usual approach is to determine which values 
of cı and c» give the y associated with the minimum average energy attainable. The 
technique for achieving this, called the linear variation method, is by far the most 
common type of quantum chemical calculation performed. 

An example of a problem that can be treated by this method is the polarizability of 
the hydrogen atom. The wavefunction for the unperturbed hydrogen atom in its ground 
state is spherically symmetrical. But, when a uniform z-directed external electric field 
of strength F is imposed, the positive nucleus and the negative electron are attracted 
in opposite directions, which leads to an electronic distribution that is skewed with 
respect to the nucleus. The wavefunction describing this skewed distribution may be 
approximated by mixing with the unperturbed 1s function some 2p, function: y = 
cils+c22p-. As indicated in Fig. 7-2, this produces a skewed wavefunction because 
the 2p, function is of the same sign as the 1s function on one side of the nucleus and 
of the opposite sign on the other. We will work out the details of this example after 
developing the method for the general case. 

Let the generalized trial function y be a linear combination of known functions 


$1. 92, ... , Øn. (This set of functions is called the basis set for the calculation.) 
V = cidi +0262 +--+ + Cnn (7-37) 
where the coefficients c are to be determined so that 
*Ĥydt - 
[wHwdt _ 5, (7-38) 
J y*ydr 


is minimized. Substituting Eq. (7-37) into Eq. (7-38) gives 


S (ctot + SO} +t cio) Â cigi enda +--+ + nbn) dt 


f (ctt + 305 +++ + cH) (cipi + 0202 +: + enGn) dt 
num 
= denom (7-39) 


E- 


Since we will be dealing with cases in which the c's and @’s are real, we will temporar- 
ily omit the complex conjugate notation to simplify the derivation. At the minimum 
value of E, 


dE OE 3E 
=— =...=— =0 


Gre S E Ecl (7-40) 
dc, ac 0Cn 
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Z (a.u.) 


Figure 7-2 » Values of y versus z for 1s state of H atom (—-) and for approximate wavefunction 
given by 0.982 1s —0.188 2pz (- - -). The nucleus is at z = 0 for each case. 


The partial derivative of Eq. (7-39) with respect to c, is 


9E fÊ cipit ed) dt, [Cibi e$) Heidt 
acy denom denom 


— (num) (denom) ? J $1 (c11 +++: + 654) dt 


+ J (cipit: d 
=0 (7-41) 
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Multiplying through by denom, recalling that num/denom equals E, and rearranging, 
gives 
C] J pı Hg dt — E f 6161 4 


cca | J óifldod: — E J $1do dz 
+-+ Cn | [ios an E f ordnar 


=0 (7-42) 


At this point, it is convenient to switch to an abbreviated notation: 


J ete, dt = Hij (7-43) 


fow dt = Sij (7-44) 


The integral 5;; is normally called an overlap integral since its value is, in certain 
cases, an indication of the extent to which the two functions $; and @; occupy the same 
space. Use of this abbreviated notation produces, for Eq. (7-42), 


cı (Hii — ESi) + c2 (Hi2 — ESi2) e (Hin — ESin) =0 (7-45) 
A similar treatment for 0 £/dc; gives a similar equation: 
cı (Ha — ESi) + e2 (Hio — ESiz) +--+ + cn (Hin — ESin) =0 (7-46) 


Thus, requiring that 9 E/8c; vanish for all coefficients produces n homogeneous linear 
equations (homogeneous, all equal zero; linear, all c;'s to first power). If one chooses 
a value for E, there remain n unknowns—the coefficients c;. (The integrals H;; and 
Sij are presumably knowable since Ê and the functions $; are known.) Of course, one 
trivial solution for Eqs. (7-46) is always possible, namely, c; = c? =---=c, —0. But 
this corresponds to y — 0, a case of no physical interest. Are there nontrivial solutions 
as well? In quantum chemical calculations, nontrivial solutions usually exist only for 
certain discrete values of E. This provides the approach for solving the problem. 
First, find those values of E for which nontrivial coefficients exist. Second, substitute 
into Eqs. (7-46) whichever of these values of E one is interested in and solve for the 
coefficients. (Each value of E has its own associated set of coefficients.) But how do 
we find these particular values of E that yield nontrivial solutions to Eqs. (7-46)? The 
answer is given in Appendix 2, where it is shown that the condition which must be met 
by the coefficients of a set of linear homogeneous equations in order that nontrivial 
solutions exist is that their determinant vanish. Notice that, in the standard treatment 
given in Appendix 2, the coefficients are known and x, y, and z are unknown. Here, 
however, the coefficients c; are unknown, and Hj; and S;; are known. Therefore, it is 
the determinant of the H’s, S’s and E in Eqs. (7-46) that must equal zero: 


Hy — ESn Hiz =E Siz Hin — ESin 
Hi — ES» H»-—ES»: Hy-—ES 


=0 (7-47) 


Ani — ES Hy — E Sm FAS Alin = E Spn 
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Expansion of this determinant gives a single equation containing the unknown E. Any 
value of E satisfying this equation is associated with a nontrivial set of coefficients. 
The lowest of these values of E is the minimum average energy achievable by variation 
of the coefficients. Substitution of this value of E back into Eqs. (7-46) produces 
n simultaneous equations for the n coefficients. Equation (7-47) is referred to as 
the secular equation, and the determinant on the left-hand side is called the secular 
determinant. 

This method is best illustrated by example, and we will now proceed with the prob- 
lem of a hydrogen atom in a z-directed uniform electric field of strength F a.u. As 
mentioned earlier, a suitable choice of functions to mix together to approximate the 
accurate wavefunction is the 1s and 2p, hydrogenlike functions. The choice of two 
basis functions leads to a 2 x 2 secular determinantal equation: 


Hi -ESi Hi — ESi 


E =" 12) 9 (7-48) 
Mi- ES» H»-—ES» 


If we arbitrarily associate the 1s function with the index 1 and the 2p, function with 
index 2 (consistent with y =c,ls+c22p,), then the terms in the determinant are 
(returning to general complex conjugate notation): 


Hic f 1s" Aisa Hac [ 1s Bap. 


Ha = J 2ptÉlsdr Hy = J 2p! H2p, dx (7-49) 


(The electron label has been omitted since there is only one electron.) The correspond- 
ing S integrals are obtained from these if Í is omitted in each case. The hamiltonian 
operator is just that for a hydrogen atom with an additional term to account for the 
z-directed field: 


Ro 7d 
Hz-.vt — (1/r) — Fr cosó (7-50) 


[The energy of a charge —e in a uniform electric field of strength F and direction z is 
—e Fz. In atomic units, one unit of field strength is e/ ag —5.142 x 10!! V /m. The unit 
of charge in atomic units is e, so this symbol does not appear explicitly in Eq. (7-50). 
Also, the identity z= r cos0 has been used.] This may also be written 


H = Ĥpya — Fr cos (7-51) 


where Anya is the hamiltonian for the unperturbed hydrogen atom. 

The secular determinant contains four H-type and four S-type terms. However, 
evaluating these eight integrals turns out to be much easier than one might expect. 
In the first place, S12 = S21 since these integrals differ only in the order of the two 
functions in the integrand, and the functions commute. Also, because His hermitian, 
it follows immediately that H21 = Hj. This leaves us with three S terms and three H 
terms to evaluate. The three S terms are simple. Because the hydrogenlike functions 
are orthonormal, S1, and $55 equal unity, and S12 vanishes. These points have already 
reduced the secular determinantal equation to 


Hı- E Hi2 


=0 7-52 
Hh Hy—E Ue) 


Section 7-4 Linear Variation: The Polarizability of the Hydrogen Atom 201 
Consider next the term H11. This may be written as 
m=f Is* Alyatsde - | 1s* (Frcos0)lsdt (1-53) 


But the 1s function is an eigenfunction of Anya with eigenvalue -j a.u. Therefore, the 
first integral on the right-hand side of Eq. (7-53) is 


* ry EN * EN! 
1s" Hyyalsdt = 2 Is*Isdt= zau (7-54) 


The second term on the right-hand side of Eq. (7-53) is zero by symmetry since 1s*1s 
is symmetric for reflection in the x y plane while r cos 0 (= z) is antisymmetric. Thus, 
Hı = -i a.u. Similarly, H5» = -i a.u. [Recall that the 8 eigenvalues of hydrogen 
are equal to —1/ (2n?), and here n = 2.] All that remains is Hj: 


Ha [ 1s Fiya2pede = F | 1s* (rcos0)2pz dc (7-55) 


The first term on the right-hand side is easily shown to be zero: 


^ 1 
[st An fw (-;) 2p;dr —0 (7-56) 


Here we employ the fact that 2p; is an eigenfunction of Anya and then the fact that 1s 
and 2p, AOs are orthogonal. The second term on the right-hand side must be written 
out in full and integrated by “brute force.” It is remarkable that, of the eight terms 
originally considered, only one needs to be done by detailed integration. Proceeding, 
we substitute formulas for 1s and 2p, in this last integral to obtain 


-r ff (r)? exp(—r)[r cos 6] (322) ?r exp(—r/2) cos 6(r* sin) dr d0 do 
(7-57) 


We consider the integration over spin to have been carried out already, giving a factor 
of unity. Integrating over ¢ to obtain 27r, and regrouping terms gives 


-2x F/4V3n) | reparar | cos? 6 sin d0 (7-58) 
0 0 
4! | [2 gla 
=~ (F/2v2) E BH =~ ay au. (7-59) 
2 


This completes the task of evaluating the terms in the secular determinant.” The final 
result is (in atomic units) 


-i = E —2!5/2( F /35) 
: =0 (7-60) 
—2!5/2(F/35) —3-E 


2Since Hi» is real, it is clear that Hp, = Hj. 
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which expands to 

E 4 GEIS + E? - 215 F?/3! Lo (7-61) 
This is quadratic in E having roots 


5 (&+27F2/310)" 
16 2 


E- 


(7-62) 


When no external field is present, F = 0 and the roots are just -j and -i a.u., the 
1s and 2p; eigenvalues for the unperturbed hydrogen atom. As F increases from zero, 
the roots change, as indicated in Fig. 7-3. We see that, for a given field strength, there 
are only two values of E that will cause the determinant to vanish. If either of these 
two values of E is substituted into the simultaneous equations related to Eq. (7-52), 
then nontrivial values for c, and c? can be found. Thus, at F —0.1 a.u., E = —0.51425 
a.u., and E — —0.1107 a.u. are values of E for which dE/dc1, and dE /dc2 both 
vanish. The former is the minimum, the latter the maximum in the curve of E versus 
c1 (Fig. 7-4). (The normality requirement results in the two c’s being dependent, and 
so E may be plotted against either of them.) Since the variation principle tells us that 
E > Ejowest eat. We can say immediately that the energy of the hydrogen atom in a 
uniform electric field of 0.1 a.u. is —0.51425 a.u. or lower. That is, —0.51425 a.u. is 
an upper bound to the true energy. 

Now that we have the value of the lowest E, we can solve for c, and c? and obtain 
the approximate ground state wavefunction. The homogeneous equations related to the 
determinant in Eq. (7-52) are 


ei(Hi — E) ceo Hio =0 (7-63) 
ci Hio t co (Ho; — E) 20 (7-64) 


Substituting —0.51425 for E, and inserting the values for H11, H22, and Hj? found 
earlier gives (when F — 0.1 a.u.) 


0.01425c; — 0.074493c2 = 0 (7-65) 
—0.074493c, + 0.38925c5 = 0 (7-66) 

Equation (7-65) gives 
cı =5.2275c2 (7-67) 


If we substitute this expression for c; into Eq. (7-66) we get 
—0.3892c» + 0.3892c2 = 0 (7-68) 


This is useless for evaluating c2. It is one of the properties of such a set of homogeneous 
equations that the last unused equation is useless for determining coefficients. This 
arises because an equation like (7-63) still equals zero when c; and c? are both multiplied 
by the same arbitrary constant. Therefore, these equations are inherently capable of 
telling us the ratio of cı and c» only, and not their absolute values. We shall determine 
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Figure 7-3 » Average energies for a hydrogen atom in a uniform electric field of strength F as 
given by a linear variation calculation using a 1s, 2pz basis. (- - -) Results from accurate calculations. 


absolute values by invoking the requirement that y; be normalized. In this case, this 
means that 


ci-c-l (7-69) 
or 
(5.227502) +2 =1 (7-70) 
which gives 
c) = +0.18789 (7-71) 


If we arbitrarily choose the positive root for c2, it follows from Eq. (7-67) that 
cı = 0.98219. 
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Figure 7-4 » E versus cj for a hydrogen atom in a uniform electric field of strength 0.1 a.u. 


Thus, when F — 0.1 a.u., the linear variation method using a 1s, 2p; basis set gives 
an upper bound to the energy of —0.51425 a.u. and a corresponding approximate 
wavefunction of 


y — 0.98219 1s -- 0.18789 2p. (1-72) 


As mentioned earlier, the admixture of 2pz with 1s produces the skewed charge distri- 
bution shown in Fig. 7-2. 

Itis important to note that the extent of mixing between 1s and 2p; depends partly on 
the size of the off-diagonal determinantal element H1». When H1» is zero (no external 
field), no mixing occurs. As Hj» increases, mixing increases. Generally speaking, 
the larger the size of the off-diagonal element connecting two basis functions in the 
secular determinant, the greater the degree of mixing of these basis functions in the 
final solution, other factors being equal. H;; is sometimes referred to as the interaction 
element between basis functions i and j. 

If we carried through the same procedure using the maximum E of —0.1107 a.u., 
we would obtain the approximate wavefunction 


y’ — 0.98219 2p, — 0.18789 1s (7-73) 


This wavefunction is orthogonal to y. It may be proved that the second root, 
E — —0.1107 is an upper bound for the energy of the second-lowest state of the 
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hydrogen atom in the field. However, V is probably not too good an approximation 
to the exact wavefunction for that state. This is partly because the wavefunction w’ 
is one that maximizes E. Hence, there is no particular tendency for the procedure to 
isolate the second-lowest state from the infinite manifold of states. Also, our basis set 
was chosen with an eye toward its appropriateness for approximating the lowest state. 
The true second-lowest state might be expected to contain significant amounts of the 
2s AO, which is not included in this basis. 

The values of E versus c; are plotted in Fig. 7-4. The values of E that we obtained 
by the variation procedure correspond to the extrema in this figure. The low-energy 
extreme corresponds to a wavefunction that shifts negative charge in the direction it 
is attracted by the field. The high-energy extreme corresponds to a wavefunction that 
shifts charge in the opposite direction. (When a calculation is performed over a more 
extensive basis to produce more than two roots, the highest and lowest roots correspond, 
respectively, to the maximum and minimum, the other roots to saddle points, on the 
energy hypersurface.) 

The detailed treatment just completed is rather involved, and so we now summarize 
the main points. Step 1 involved selection of a basis set of functions which is capa- 
ble of approximating the exact solution. Step 2 was the construction of the secular 
determinant, including evaluation of all the H;j- and 5;;-type integrals. Step 3 was 
the conversion of the determinantal equation into its equivalent equation in powers of 
E and solution for the roots E. Step 4 was the substitution of an E of interest into 
the simultaneous equations that are related to the secular determinant and solution for 
c1/co. Finally, we used the normality requirement to arrive at convenient values for 
cı and c». 

There are many ways one could increase the flexibility of the trial function in an 
effort to increase the accuracy of the calculation. By adding additional basis functions, 
one would stay within the linear variation framework, merely increasing the size of the 
secular determinant. If these additional basis functions are of appropriate symmetries, 
they will cause the minimum energy root to be lowered further and will mix into the 
corresponding wavefunction to make it a better approximation to the lowest-energy 
eigenfunction for the system. Also, the additional functions will increase the number 
of roots E, thereby providing upper bounds for the energies of the third-, fourth-, etc. 
lowest states of the system. Another possibility is to allow nonlinear variation of the 
1s and 2p, orbital exponents, in combination with linear variation. This would be more 
involved than the calculation we have shown here, but could easily be accomplished 
with the aid of a computer. 


EXAMPLE 7-1 Suppose the variational process just described were performed 


using V (c1, c2, c3) =c1 ls +¢22po + e33do. Would all three of these AOs be present 
in the lowest-energy solution? 


SOLUTION » We already know that 1s and 2po will be mixed. They differ in reflection sym- 
metry in just the right manner to provide a wavefunction skewed in the z direction, and this is 
manifested as a nonzero interaction element H12. 3do, however, is, like 1s, symmetric for reflection 
through the x, y plane. Therefore, it cannot skew 1s in the z direction, and its interaction element 
with 1s, H13, equals zero (by symmetry). Therefore, it is tempting to think that that 3do will not 
contribute. However, because 3d and 2po have opposite reflection symmetries through the x, y 
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plane, they do interact, H23 is not zero, and so all three basis functions show up in the lowest-energy 
solution. 1s and 3dọ are indirectly linked because they are each directly linked to 2pg. Physically, 
one can argue that a 2po AO that has been polarized by admixture with 3do can better polarize the 
1s AO than can the pure 2po AO. (One could say that 3d is brought in “on the coat-tails" of 2po.) 
Note also that 1s has zero angular momentum along the z axis before the electric field is turned 
on. The z-directed field distorts the ground state, but has no effect on the z component of angular 
momentum. Therefore, only eigenfunctions having m; = 0 can contribute to the polarized ground 
state. < 


N 7-5 Linear Combination of Atomic Orbitals: 


The H3 Molecule-lon 


We are now ready to consider using the linear variation method on molecular systems. 
We begin with the simplest case, Hy . This molecule-ion has enough symmetry so that 
we could guess many important features of the solution without calculation. However, 
to demonstrate the method, we shall first simply plunge ahead mathematically, and 
discuss symmetry later. 

The Hy system consists of two protons separated by a variable distance R, and a 
single electron (see Fig. 7-5). The hamiltonian for this molecule is, in atomic units 


T ; l| vV% NG 1 1 1 0-74) 
r vr SE = ! t ! = 
nM 2|!" 1836 ' 1836 Em mm R 


Since all the particles in the system are capable of motion, the exact eigenfunction 
of A will be a function of the coordinates of the electron and the protons. However, 
the protons are each 1836 times as heavy as the electron, and in states of chemical 
interest their velocity is much smaller than that of the electron. This means that, to a 
very good degree of approximation, the electron can respond instantly to changes in 
internuclear separation. In other words, whenever the nuclei are separated by a given 
distance R, no matter how they got there, the motion of the electron will always be 
described in the same way by y. This means that we can separate the electronic and 
nuclear motions with little loss of accuracy.? Given an internuclear separation, we can 
solve for the electronic wavefunction by ignoring nuclear motion [i.e., omitting v and 


A R B 


Figure 7-5 » The Hj molecule-ion. A and B are protons. The electron is numbered “1.” 


3 However, there are times when coupling of electronic and nuclear motions becomes important. 
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Vi in Eq. (7-74)]. This gives us a hamiltonian for the electronic energy and nuclear 


repulsion energy of the system: 
1 1 1 
aa) ve 
rA rB] R 


For a given internuclear separation R, the internuclear repulsion 1/R is a constant, and 
we can omit it and merely add it on again after we have found the electronic energy. If 
we let Helec stand for the first three terms on the right-hand side of Eq. (7-75), we can 
write 


X 1 2 
H (r4, rp, ri) — 2 


Aetec Welec (r1) — Elec Welec (r1) (7-76) 
and 
Eelec + Enuc rep — Eelec + 1/R (7-77) 


Solving Eq. (7-76) for Ege for every value of R allows us to plot the electronic and 
also the total energy of the system as a function of R. But Eejec(R) + 1/R is just the 
potential energy for nuclear motion. Therefore, this quantity can be inserted as the 
potential in the hamiltonian operator for nuclear motion: 


luc , | t elec ] 
Aue (E 4, Y B) Vnuc(r 4. rB) EnucWV nuc (F 4, FB) © 79) 


The eigenfunctions of Eq. (7-79) describe the translational, vibrational, and rotational 
states of the molecule. Note that the eigenvalues of the hamiltonian for nuclear motion 
are total energies for the system because they contain the electronic energy in their 
potential parts. Hence, 


Etot = Enc (7-80) 
But 


V'tot (r4 rg .ryj= Welec (r1) Vnuc (r4 Fg) (7-81) 


This approximation—that the electronic wavefunction depends only on the positions 
of nuclei and not on their momenta—is called the Born-Oppenheimer approximation.* 
Only to the extent that this approximation holds true is it valid, for example, to sep- 
arate electronic and vibrational wavefunctions and treat various vibrational states as 
a subset existing in conjunction with a given electronic state. We will assume the 
Born-Oppenheimer approximation to be valid in all cases treated in this book. 

Making the Born-Oppenheimer approximation for Hy , we seek to solve for the 
electronic eigenfunctions and eigenvalues with the nuclei fixed at various separation 
distances R. We already know these solutions for the two extremes of R. When the two 


^Tt is analogous to the concept of reversibility in thermodynamics: The piston moves so slowly in the cylinder 
that the gas can always maintain equilibrium, so pressure depends only on piston position; the nuclei move so 
slowly in a molecule that the electrons can always maintain their optimum motion at each R, so electronic energy 
depends only on nuclear position. 
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nuclei are widely separated, the lowest-energy state is a 1s hydrogen atom and a distant 
proton. (Since there is a choice about which nucleus is "the distant proton,” there are 
really two degenerate lowest-energy states.) When R — 0, the system becomes Het 
These two extremes are commonly referred to as the separated-atom and united-atom 
limits, respectively. 

In carrying out a linear variation calculation on H7, our first problem is choice of 
basis. In the separated-atom limit, the appropriate basis for the ground state would be a 
1s atomic orbital (AO) on each proton. Then, regardless of which nucleus the electron 
resided at, the basis could describe the wavefunction correctly. The appropriate basis 
at the united-atom limit is a hydrogenlike 1s wavefunction with Z — 2. At intermediate 
values of R, choice of an appropriate basis is less obvious. One possible choice is 
a large number of hydrogenlike orbitals or, alternatively, Stater-type orbitals (STOs), 
all centered at the molecular midpoint. Such a basis is capable of approximating the 
exact wavefunction to a high degree of accuracy, provided a sufficiently large number 
of basis functions is used.? Calculations using such a basis are called single-center 
expansions. A different basis, and one that is much more popular among chemists, 
is the separated-atom basis—a 1s hydrogen AO centered on each nucleus. At finite 
values of R, this basis can produce only an approximation to the true wavefunction. 
One way to improve this approximation is to allow additional AOs on each nucleus, 2s, 
2p, 3s, etc., thereby increasing the mathematical flexibility of the basis. If we restrict 
our basis to AOs that are occupied in the separated-atom limit ground state (the 1s AOs 
in this case), then we are performing what is called a minimal basis set calculation. For 
now, we will use a minimal basis set. 

The wavefunction that we produce by linear variation will extend over the whole 
Hy molecule, and its square will tell us how the electron density is distributed in the 
molecule. Hence, the one-electron molecular wavefunction is referred to as a molecular 
orbital (MO) just as the one-electron atomic wavefunction is referred to as an atomic 
orbital (AO). With a basis set of the type we have selected, the MOs are expressed as 
linear combinations of AOs. For this reason, this kind of calculation is referred to as 
a minimal basis set linear combination of atomic orbitals—molecular orbital (LCAO— 
MO) calculation. 

Our second problem, now that we have selected a basis, is construction of the secular 
determinant. Since we have only two basis functions (1s4, lsg), we expect a 2 x 2 
determinant: 


H44— ES44. Hag — ESAn 


- - =0 (7-82) 
Hpa—ESpa Hgg- ESpp 
Here we have used the notation developed earlier, where 
Hag [15500 Pass (D 1s 0) dt (1) (7-83) 


etc. E of Eq. (7-82) is the average value of the energy. Henceforth, the bar is omitted. 
If we take our basis functions to be normalized, $44 = Spp = 1. Since our basis 
functions and hamiltonian are all real, their integrals will be real. Therefore, $45 = Sp 4 


5The hydrogenlike orbitals are a complete set if the continuum functions are included. Hence, this set can 
allow one to approach arbitrarily close to the exact eigenfunction and eigenvalue. The Slater-type orbitals do not 
constitute a complete set. 


Section 7-5 Linear Combination of Atomic Orbitals: The H5 Molecule-lon 209 


and H4p = Ay 4 = Ha4. Since the hamiltonian is invariant to an interchange of the 
labels A and B, it follows that H44 = Hgg. (H44 is the energy of an electron when 
it is in a 1s AO on one side of the molecule, Hgg when it is on the other side.) This 
leaves us with 


H44—E H4g— ES 
AA AB AB] o (7-84) 
H4g— ESag H44—E 


For Eq. (7-84) to be satisfied, the product of the diagonal terms must equal that of the 
off-diagonal terms, which means that 


H44 — E— X(Hag — E SaB) (7-85) 
This gives two values for E: 
HA44 X H, 
pes (7-86) 
1+ S48 


To arrive at numerical values for E+} and E— requires that we choose a value for R 
and explicitly evaluate H44, H45, and $45. We know in advance that Sp increases 
monotonically from zero at R = oo to unity at R = 0 because 1s4 and 1sg are each 
normalized and everywhere positive. H44 is the average energy of an electron in a 1s 
AO on nucleus A, subject also to an attraction by nucleus B. Hence, H44 should be 
lower than the energy of the isolated H atom e a.u.) whenever R is finite. 4p is 
easily expanded to 


1 
Han= f 154 (-5v um) Ispdv+ f Isy(-1/ra)ised (7-87) 


The operator in the first integrand is simply the hamiltonian operator for a hydrogen 
atom centered at nucleus B. This operator operates on 1sg to give —jls p. Hence, 
the first integral becomes simply — 58 4B. The second integral in Eq. (7-87) gives the 
attraction between a nucleus and the “overlap charge." Thus, H4g is zero at R= oo and 
negative for finite R. The formulas for these terms are (after nontrivial mathematical 
evaluation) 


ve = f tsatsedv=exp(-R) [1+ R+R7/3| (7-88) 


. 1 
H44 = f tsaflacclsadv=—3- (1/8) |1-e?*a4 8 (7-89) 


HAB = f 1siftstsads- S4p/2—e * (1+ R) (7-90) 


When R =2 a.u., $4g — 0.586, H44 = —0.972 a.u. and Hyg = —0.699 a.u. Inserting 
these values into Eq. (7-86) gives E} = —1.054 a.u. and E.. = —0.661 a.u. These 
are electronic energies. Internuclear repulsion energy (+5 a.u.) can be added to these 
values to give —0.554 a.u. and —0.161 a.u., respectively. 

The ways in which H44, H4p, $45, and 1/R contribute to the energy are illustrated 
for R —2 a.u. in Fig. 7-6. H44 is lower than the energy of an isolated H atom because 
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Figure 7-6 » Contributions to the energy of Hj at R 22 a.u. in a minimal basis LCAO-MO cal- 
culation. 


the electron experiences additional nuclear attraction at R — 2. The effect of the H4z 
interaction element is to split the energy into two levels equally spaced above and below 
H44. The S45 term has the effect of partially negating this splitting. The internuclear 
repulsion energy 1/ R merely raises each level by 1 a.u. The lower energy, E+ + 1/R, 
has a final value that is lower than the separated-atom energy of -i a.u. Since the exact 
energy at R —2 must be as low or lower than our value of —0.554 a.u., we can conclude 
that the Hy molecule-ion has a state that is stable, with respect to dissociation into 
H+ H* , by at least 0.054 a.u., or 1.47 eV, or 33.9 kcal/mole, neglecting vibrational 
energy effects. 

The data depicted in Fig. 7-6 are sometimes presented in the abbreviated form of 
Fig. 7-7. The energy levels for the pertinent AOs of the separated atoms are indicated on 
the left and right, and the final energies (either electronic or electronic plus internuclear) 
are shown in the center. 
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Figure 7-7 » Separated atom energies and energies at an intermediate R for Bi ! 


E (a.u.) 
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Figure 7-8 » E+ + 1/R versus R for Hs . (—) Calculation described in text. (- - -) Exact 
calculation. 


The behavior of these energies as a function of R is plotted in Fig. 7-8. Included 
for comparison are the exact energies for the two lowest-energy states of Hy . Only 
the lower of these shows stability with respect to molecular dissociation. Both energy 
levels approach infinity asymptotically as R approaches zero because of internuclear 
repulsion. (The zero of energy corresponds to complete separation of the protons and 
electron.) 

Having found the roots E+ for the secular determinant, we can now solve for the 
coefficients which describe the approximate wavefunctions in terms of our basis set. 
Let us first find the approximate wavefunction corresponding to the lower energy. To do 
this, we substitute the expression for £4 [Eq. (7-86)] into the simultaneous equations 
associated with the secular determinant (7-84): 


CACHAA — E+) t eg(HA4g — ExS4p) —0 (7-91) 
CA(HAg — E+ S48) - eg(H44 — E4) =0 (1:92) 
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Equation (7-91) leads to 


cA[Ha4 — (H44 + Hag)/ (14+ $,,)] 9 —en [Hag — (H44 + Hag)San/ (1 - S,5)] 


(7-93) 
which ultimately gives 
Ce CR (7-94) 
The same procedure for E.. produces the result 
CA=—CB (7-95) 


The normality requirement is 
| Wydv=1= [cats + cglspg)? dv 


1 1 Sas 
=c}? | Is 2 do + cy? | been, | 1sq1s, dv (7-96) 


so that 
c) t c3 + 2c4cgS4p—1 (7-97) 
For c4 — cg, this gives 
c4=1/[2(1+ S45)] 7 =cg (7-98) 
For c4 = —cp, 
c4=1/[2(1— $45)] 7 2 —eg (7-99) 


Therefore, the LCAO-MO wavefunction corresponding to the lower energy E. is 


1 
ya — (1s4 + Isg) (7-100) 
V/2(1 + S45) 
The higher-energy solution is 
y--— (154 — lsg) (7-101) 
V/2(1 — S45) 


Just as was true for AOs, there are a number of ways to display these MOs pictorially. 
One possibility is to plot the value of w+ or y2 along a ray passing through both nuclei. 
(Other rays could also be chosen if we were especially interested in other regions.) 
Another approach is to plot contours of y or Y°? on a plane containing the internuclear 
axis, Still another way is to sketch a three dimensional view of a surface of constant 
value of y or Y? containing about 90-95% of the wavefunction or the electron charge. 
Finally, one can plot the value of y as distance above or below the plane containing 
the internuclear axis. All of these schemes are shown in Fig. 7-9 for w+ and y... 
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(a) 


(b) 


Figure 7-9 » (a) Plot of w+, along the z axis. [Eq. (7-100)]. (b) Plot of w— along the z axis 
[Eq. (7-101)]. 


The wavefunctions y4 and w_ may be seen from Fig. 7-9 to be, respectively, sym- 
metric and antisymmetric for inversion through the molecular midpoint. [They are 
commonly called gerade (German for even) and ungerade, respectively.] This would 
be expected for nondegenerate eigenfunctions of the hamiltonian, since it is invariant 
to inversion. But y4 and y_ are not eigenfunctions. They are only approximations 
to eigenfunctions. How is it that they show the proper symmetry characteristics of 
eigenfunctions? The reason is that the symmetry of the Hy molecule is manifested as 
a symmetry in our secular determinant of Eq. (7-84). Note that the determinant is sym- 
metric for reflection across either diagonal. The symmetry for reflection through the 
principal diagonal (which runs from upper left to lower right) is due to the hermiticity 
of Í, and is always present in the secular determinant for any molecule regardless of 
symmetry. Symmetry for reflection through the other diagonal is due to the fact that 
the hamiltonian is invariant to inversion and also to the fact that the basis functions at 


ÉHermiticity requires that H;; = H P If Hj; is imaginary, the determinant will be antisymmetric for reflection 
through the principal diagonal. 
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(c) 


(d) 
Figure 7-9 » (Continued) (c) Contour diagram of y+. (d) Contour diagram of y... 


the two ends of the molecule are identical. When the AOs in a basis set are interchanged 
(times +1) by a symmetry operation of the molecule, the basis set is said to be balanced 
for that operation. Thus, a 1s4 and 1sg basis is balanced for inversion in Ht, but a 
1s4 and 2sg basis is not. Whenever a symmetry-balanced basis is used for a molecule, 
the symmetry of the molecule is manifested in the secular determinant and ultimately 
leads to approximate MOs that show the proper symmetry characteristics. 

Since the eigenfunctions for Hy must be gerade or ungerade, and since we started 
with the simple balanced basis 1s 4 and 1sz,, it should be evident that it is unnecessary 
to go through the linear variation procedure for this case. With such a simple basis 
set, there is only one possible gerade linear combination of AOs, namely 1s 4 + 1s 3. 
Similarly, 1s 4 — 1sg is the only possible ungerade combination. Therefore, we could 
have used symmetry to guess our solutions at the outset. Usually, however, we are not so 
limited in our basis. We shall see that, while symmetry is useful in such circumstances, 
it does not suffice to produce the variationally best solution. 
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(e) 


iU 


Figure 7-9 » (Continued) (e) Three-dimensional sketch of contour envelope for w+ and 


(f) for w_. 


According to the theorem mentioned earlier (but not proved), the nth lowest root 
of a linear variation calculation for a state function must lie above the mth lowest 
exact eigenvalue for the system. However, the two states we are dealing with have 
different symmetries. In such a case, a more powerful boundedness theorem holds— 
one that holds even if we are not using a linear variation procedure. To prove this, 
we first recognize that every Hj eigenfunction is either gerade (1.e., symmetric for 
inversion) or ungerade. Since the lowest-energy approximate wavefunction w+ is 
gerade, it must be expressible as a linear combination of these gerade eigenfunctions. 
Hence, its average energy E, cannot be lower than the lowest eigenvalue for the 
gerade eigenfunctions. Similarly, E_ cannot be lower than the lowest eigenvalue for 
the ungerade eigenfunctions, and so we have a separate lower bound for the average 
energy of trial functions of each symmetry type. For this reason, our approximate 
energies in Fig. 7-8 must lie above the exact energies for both states. If we were to 
make further efforts to lower the average energy of the ungerade function, even by 
going outside the linear variation procedure, we could never fall below the exact energy 
for the lowest-energy ungerade state unless we somehow allowed our trial function to 
change symmetry. This means that a lowest average energy criterion can be used in 
attempting to find the lowest-energy state of each symmetry type for a system, by either 
linear or nonlinear variation methods. 
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Figure 7-9 » (Continued) (g) Value of y+ and (h) of y... versus position in plane containing the 
nuclei at points a and b. 


EXAMPLE 7-2 Suppose one were to do a variational calculation on the hydrogen 


atom using the unnormalized trial wavefunction $ = exp(—or?) cos 6, with o being 
varied. What lower bound could we expect for the average energy? 


SOLUTION » This function has a node in the x, y plane (where 0 = 7/2), and this nodal plane 
exists regardless of the value of æ. Therefore, our trial function is like the 2po AO in symmetry. It 
will be represented by a linear combination of py AOs—a combination that changes as o is varied. 
The average value of energy cannot be lower than the lowest eigenvalue in the set, which is — 1/8 a.u. 
(for the 2pg AO). 4 


Inspection of Fig. 7-9 shows that the charge distribution for the state described by 
W4 is augmented at the molecular midpoint compared to the charge due to unperturbed 
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atoms. This state is also the one that gives Hy stability at finite R. Because this MO 
puts electronic charge into the bond region and stabilizes the molecule, it is commonly 
called a bonding MO. In the state described by w_, charge is shifted out of the bond 
region and the molecule is unstable at finite R, and so y... is called an antibonding MO. 

Because the potential in Hy (or in any linear molecule) is independent of 4$, the 
angle about the internuclear axis, the @ dependence of the wavefunctions is always of 
the form 


(p) = (1/2) exp(img), m=0,+1,+2,... (7-102) 


This fact may be arrived at in two ways. One way is to write down the Schródinger 
equation for H; using spherical polar coordinates or elliptical coordinates. ($ is a 
coordinate in each of these coordinate systems.) Then one attempts to separate coor- 
dinates and finds that the $ coordinate is indeed separable from the others and yields 
the equation 


— —— = —m? (Q) (7-103) 


The acceptable solutions of this are the functions (7-102). The other approach is to note 
that, since the potential in Ê has no $ dependence, Ê commutes with D the angular 
momentum operator, so that the eigenfunctions of Hare simultaneously eigenfunctions 
of Ly We know the eigenfunctions of Ê, are the functions (7-102), and thus, we know 
that these functions must also give the @ dependence of the eigenfunctions of B. 

Two important conclusions emerge. First, each nondegenerate Hj wavefunction 
must have a @ dependence given by one of the functions (7-102). This tells us something 
about the shapes of the wavefunctions. Second, the nondegenerate Hy wavefunctions 
are eigenfunctions of i, which means that an electron in any one of these states 
has a definite, unvarying (i.e., sharp) component of angular momentum of value mf 
mks units (m a.u.) along the internuclear axis. It is thus useful to know the m value 
associated with a given one-electron wavefunction. A standard notation is used, which 
is analogous to the atomic orbital notation wherein s, p, d, f, correspond to / values of 
0, 1, 2, 3, respectively. The corresponding Greek lower-case letters o, 7, ô, $ indicate 
values of |m| of 0, 1, 2, 3, respectively, in one-electron orbitals of linear molecules. 
Thus, for the case at hand, w+ and y_ are both o MOs because they are cylindrically 
symmetrical (i.e., no $ dependence), which requires that m be zero. Because y4 is a 
gerade function, it is symbolized og. y_, then, isa cy MO. (A simple way to determine 
whether an MO is ø, 7, or ô isto imagine viewing it (in its real form) end-on (i.e., along 
a projection of the internuclear axis.) If the MO from this view looks like an s AO, 
it is a c MO. If it looks like a p AO, it is a zt MO. A ó MO has the four-leaf-clover 
appearance of a d AO.) 

Let us now examine the dependence of our LCAO-MO results on our original choice 
of basis. The 1s AOs we have used are capable of giving the exact energy when 
R — co, but become increasingly inadequate as R decreases. As a result, Fig. 7-8 
shows that, for both states, the approximate energy deviates more and more from the 
exact energy as R decreases. At R — 0, our og function becomes a single 1s H atom 
(Z — 1) AO centered on a doubly positive nucleus. Yet we know that the lowest- 
energy eigenfunction for that situation is a single Is He* (Z = 2) AO. An obvious 
way to improve our wavefunction, then, is to allow the 1s basis functions to change 
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their orbital exponents as R changes. This adds a nonlinear variation, and the calcu- 
lation is more complicated. It is very easily performed with the aid of a computer, 
however, and we summarize the results in Figs. 7-10 and 7-11. The internuclear repul- 
sion has been omitted from the energies in Fig. 7-10. The lowest approximate energy 
curve is now in perfect agreement with the exact electronic energy both at R — 0 and 
R — co, and shows improved, though still not perfect, agreement at intermediate R. 
The R dependence of the orbital exponent for this wavefunction (Fig. 7-11) varies 
smoothly from 1 at R = oo to 2 at R — 0, as expected. In contrast, the oy function 
fails to reach a well-defined energy at R — 0 because the function becomes indeter- 
minate at that point. [At R =0, 1s4 — lsg becomes (1s4 — 1s4) = 0.] However, 
Figs. 7-10 and 7-11 indicate that the exact energy E.. is -i a.u. at R =Q, corre- 
sponding to the n = 2 level of He*, and that the orbital exponent in our 1s basis 
functions approaches 0.4 in the effort to approximate this state function at small R. 
To understand this behavior, we must once again consider the symmetries of these 
functions. 

We have already seen that the nondegenerate eigenfunctions of Hj must be either 
gerade or ungerade, and we note in Fig. 7-10 that the energy curve for the gerade state 
is continuous as is the one for the ungerade state. In other words, as we move along 
a given curve, we are always referring to a wavefunction of the same symmetry. This 
continuity of symmetry along an energy curve is central to many applications of quan- 
tum chemistry. The reason for continuity of symmetry can be seen by considering a 
molecule having some element of symmetry, and having a nondegenerate wavefunction 
or molecular orbital (which must be symmetric or antisymmetric with respect to the 
symmetry operations of the molecule). If we change the molecule infinitesimally (with- 
out destroying its symmetry), we expect the wavefunction to change infinitesimally also. 


+1 


Ente (a-u-) 


R (au) 


Figure 7-10 » ŒE and E. for Hy from (- - -) exact, (---) variable £, and (—) fixed ¢ (¢ = 1) 
treatments. 
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Figure 7-11 » Values of Z minimizing E+ and E.. as a function of R. 


In particular it should not change symmetry because this is generally not an infinitesi- 
mal change. (To change symmetry requires adding or removing nodes, changing signs 
in parts of the function. Such changes have more than infinitesimal effects on w and 
on kinetic and potential parts of the energy.) The entire curve can be traversed by an 
infinite number of such infinitesimal but symmetry conserving steps. 

The continuity of symmetry enables the og state of Hy to correlate with an s-type 
AO of He* as R goes to zero. This correlation is symmetry allowed because the 
s-type AOs of He* have the proper symmetry characteristics—they are gerade and 
have no dependence on angle about the axis that is the internuclear axis when R > 0 
(see Fig. 7-12). In contrast, the o; MO cannot correlate with an s-type AO. It must 
correlate with an AO that is cylindrically symmetrical about the old internuclear axis 
but is antisymmetric for inversion. A p-type AO pointing along the old internuclear 
axis (called a pg AO) satisfies these requirements. 


$ $ ? 
oO QE mE : 


(old internuclear axis) 


9s 95 [^ 
(1s) 
$ $ [ 
9, 9, 9, 
(2p,) 


Figure 7-12 » Sketches demonstrating how separated atom functions can be related to united-atom 
functions through symmetry invariance. These functions are not drawn to a common scale. 
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These symmetry requirements help us understand the oy curves of Figs. 7-10 and 
7-11. The exact energy for E_ goes to -i a.u. at R — 0 because the oy state of Hy 
correlates with a 2p AO of He*. Our basis set is incapable of reproducing a 2p AO at 
R — 0, and so the calculated energy curve fails to rejoin the exact curve at R — 0. At 
small R, our basis set is attempting to approximate the two lobes of an evolving 2p 
AO. Apparently the orbital exponent that best enables the basis to accomplish this is 
around 0.4. 


N 7-6 Molecular Orbitals of Homonuclear 
Diatomic Molecules 


We have already seen how one produces the ground configurations for many- 
electron atoms by placing pairs of electrons of opposite spin into AOs, starting 
with the lowest-energy AO and working up. Subsequent manipulation of this 
product function to produce proper space and spin symmetry yields the approxi- 
mate wavefunction. Precisely the same procedure is used for molecules. Thus, 
the electronic configurations for Hy , H2, and H, are log, (lop), and (log)? lou, 
respectively, and the approximate wavefunction for H» is provided by the Slater 
determinant |1og (1) log (2) |. When we come to consider heavier homonuclear 
diatomic molecules, such as O2, we must place electrons in higher energy MOs. 
Such MOs are still provided by the minimal basis set, which now includes 1s, 2s, 
and three 2p AOs on each atom since these AOs are occupied in the separated 
atoms. We now consider the natures of the additional MOs produced by this larger 
basis. 

We begin by making a change to a basis set that is mathematically equivalent to the 
starting set but is more convenient for discussing and analyzing the problem. This new 
set is the set of symmetry orbitals (SOs) (1s 4 + 1sp), (2s4 + 2sB), etc. From our orig- 
inal ten AOs, we thus produce ten SOs. These may be normalized, if desired. Each of 
these SOs has definite symmetry. The SOs built from 2s AOs must be of og and oy sym- 
metry since the 2s AOs act like 1s AOs for all the symmetry operations of the molecule. 
The 2p AOs pointing along the internuclear axis have cylindrical symmetry and hence 
also give rise to a og and a oy SO. (We will take the internuclear axis to be coincident 
with the z axis, and so these SOs are constructed from 2po (or 2pz) AOs.) The functions 
(2p41, € 2p4.15) are m SOs because |m| = 1. Since it is difficult to visualize complex 
functions, however, the usual practice is to take linear combinations of the complex 
7 functions to produce a corresponding set of real functions. (This is completely anal- 
ogous to forming real p, and p, AOs from complex p+; and p_; AOs.) Thus, we obtain 
(2px, + 2px,) and (2p), + 2py,), which are not eigenfunctions for the l- operator 
anymore, but are still given the symbol z. From Fig. 7-13, we see that the positive 
combinations give ungerade zt SOs. This is just the opposite of the case for o -type SOs. 
We conclude from Fig. 7-13 that og and zy SOs will tend to place charge into the bond 
and hence contribute to bonding, whereas oy and zr, SOs will contribute antibonding 
character. 

Even though these SOs are only a basis set, the reader may nevertheless recognize 
that conversion to this symmetrized basis goes a long way toward producing our ultimate 
MOs. Indeed, our (1s 4 + 1s) SOs, if normalized, are the same as the MOs we obtained 
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OO | OO 


CO COs  eQe 
CO CO- Gee» 


(b) 


Figure 7-13 » Symmetry orbitals constructed from s- and p-type AOs. (a) Sketches according to 
an idealized convention that ignores overlap between AOs on A and B. (b) Effects of overlap. Note 
that the po, — Pog combination is bonding. This depends on our having chosen a common z axis for 
both atoms. Sometimes the z axes are chosen to point from each atom toward the other. In that case, 
Po4 — Pog becomes antibonding. 


for Hj . The essential advantage of a symmetrized basis set is that it simplifies the 
secular determinant and makes it easier to understand and describe the mixing of the 
basis functions by the hamiltonian. For instance, since our MOs must have pure o, 
mz, ô, ... and also g or u symmetry, and since our SOs are already of pure symmetry, 
we expect no further mixing to occur between SOs of different symmetry in forming 
MOs. This suggests that the interaction element H;; between SOs ¢; and $; of different 
symmetry should vanish. This is easily proved by noting that Å is symmetric for 
all symmetry operations of the molecule, and, if $; and $; differ in symmetry for 
some operation, their product is antisymmetric for that operation, and therefore $7 Ho j 
is antisymmetric and its integral vanishes. Similarly, S;; vanishes, and Hj; — E Sij 
vanishes except in positions connecting basis functions of identical symmetry. As a 
result, our secular determinant over SOs has the form (7-104), where the notation o,[1s] 
indicates a og SO made from 1s4, Isg AOs, etc. By placing basis functions of like 
symmetry together, we have emphasized the block-diagonal form of our determinant, 
all elements in the nonshaded areas being zero by symmetry. (The zr [2p, ] and .[2py] 
do not interact because of symmetry disagreements for reflection in the xz and yz 
planes.) Each of the nonzero blocks of (7-104) is a separate determinant (which is 
just a number), and the value of determinant (7-104) is simply the product of these six 
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smaller determinants. Hence, if any one of these small determinants is zero, the large 
determinant is zero, thereby satisfying our determinantal equation. Therefore, each of 
these small determinants may be employed in a separate determinantal equation, and 
the problem is said to be partitioned into six smaller problems. It follows immediately 
that we can get mixing among the three og SOs to produce three o; MOs and likewise 
for the c, set, and that the zr SOs can undergo no further mixing and hence are already 
MOs. (Notice that SOs in one block are not linked by off-diagonal elements either 
directly or indirectly, unlike the case described in Example (7-1).) 


eg[1s] ee [2s] v, [2p] o, [1s] [2s] oul[2p] "«[2p.] 7g[2p,] Tu[2p.] 7[2p,] 


oc[1s] 
o,[2s] 
o,(2p]} 
ou{ts] 
o, [2s] 
oul2p]} 
Te[2p..] 
vc[2p,] 
mul2px] 


mul2py] 


What will be the nature of the lowest-energy og MO? It will not be pure og[1s] 
because admixture of og[2s] and og[2p] SOs can produce charge shifts that will lower 
the energy. But the o,[2s] and og[2p] SO energies are much higher than the og[1s] 
(mainly because the 2s and 2p AOs are higher in energy in the atoms, and the atomic 
contributions still dominate in the molecule). Any energy decrease to be gained by 
charge shifting must be weighed against the energy increase due to the mixing in of 
such high energy components. The former very quickly become overbalanced by the 
latter, so the lowest energy og MO is almost pure og[1s] SO, the og[2s] and og[2p] SOs 
coming in only very slightly. This exemplifies an important general feature of quantum 
chemical calculations: mixing between basis orbitals tends to be small if they have 
widely different energies in the system. Thus, we now have two factors governing the 
extent of mixing of functions $; and $ j—the size of the interaction element H;j, and 
the difference in energy between them in the system (Hi; — H jj). 

A label we can use for this lowest-energy MO that avoids implying that it is identical 
to the og[1s] SO is log. This stands for “the lowest-energy og MO." Because of the low 
energy of the 1s AOs, the next-lowest MO is almost pure o,[1s], and we label it 1o. 

When considering the remaining two og MOs, we can expect substantial mixing 
between og[2s] and og[2p] SOs because these functions are not very different in energy. 
In the hydrogen atom, the 2s and 2p AOs are degenerate, and, as we move along the 
periodic table, they become split farther and farther apart in energy. Therefore, we 
might expect to find the greatest mixing for B2 and C», and to find less mixing for O2 
and Fz. Figure 7-14 is a schematic diagram of the MO energy levels we should expect 
for F2. Here the 20g MO is primarily the o5 [2s] SO and the 3o; MO is mainly the oz [2p] 
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Mostly 2p, Mostly 2p, 


Mostly 2s 


} Almost pure 1s 


Figure 7-14 » Schematic of MO energy level order for F2. The vertical axis is not an accurate 
energy scale. For instance, the 1o levels are very much lower in energy relative to the other levels 
than is suggested by the drawing. (- - -) indicates that the orbital is unoccupied in the ground state. 


SO. Similarly 20, and 3o, are mainly o,[2s] and o, [2p], respectively. The 1m and 30 
MOs are degenerate at the separated atom limit, where they are all 2p AOs. As the 
atoms come together and interact, the zr levels split apart less than the o levels because 
the 2p, and 2p, AOs approach each other side to side, whereas the 2p, AOs approach 
end to end. The latter mode produces larger overlap and leads to larger interaction 
elements and greater splitting. (Because of the symmetry of the molecule, the 1zz, MOs 
are always degenerate, and they may be mixed together in any way. In particular, we 
can regard them as being I7yx and luy or 175,41 and 1zy—1 with equal validity. The 
same situation holds for the 17g pair.) From the ordering of energy levels in Fig. 7-14 
we obtain for F2 the configuration 


Fy: log) (10u)? 208)? Qov) Bog)? Aru) (17tg)* (7-105) 


Now we will consider what happens for lighter molecules. Recall that here the 2s 
and 2p AOs are closer together in energy. This allows greater mixing between the SOs 
containing these AOs and produces increased energy level splitting. Thus, the og[2s] 
and og[2p] SOs mix together more, and the resulting splitting causes an additional 
lowering in energy of the 2o; level and an increase for the 30, level, compared to the 
Fz case. Ina similar way the 20, and 3o levels are lowered and raised, respectively, by 
increased mixing between oy [2s] and o,[2p] SOs. The resulting energy level pattern 
for C2 is shown in Fig. 7-15. Note that the extra splitting has pushed the 3o; level above 
the 17u level. No other change in the orbital energy ordering has occurred. As a result, 
Cz has, the configuration 
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g Mostly 2D, and 2s 


Mostly 2p, 


Mostly 2s and 2p 


| Almost pure 1s 


Figure 7-15 » A schematic diagram of the MO energy level order for C2. The 2s and 2p, symmetry 
functions mix more strongly in producing 2e and 3e MOs than is the case in F2. The level shifts 
discussed in the text are indicated by dashed lines and arrows. (——--——) represent Fz levels. Note 
the inversion of the order of the Izy and 37g levels. The orbital ordering is deduced from spectra. 
(See Mulliken [1].) (— — —) represent energies of orbitals not occupied by electrons. Note that there 
is no well-defined energy ordinate for this figure, and no accurate relationship between absolute values 
of orbital energies within a molecule or between molecules is implied. 


C2 : (10g)? (10u)? (208)? Qo.) (1))* (7-106) 


The above discussion is an effort to rationalize orbital energies obtained from cal- 
culations or deduced from molecular spectra. We have not yet described the details of 
how one goes about carrying out MO calculations on these molecules, nor will we in 
this chapter. However, even in the absence of precise numerical results, itis possible for 
such qualitative arguments to be very useful in understanding and predicting features 
of a wide range of chemical reactions. 

The molecular configurations obtained from the patterns of Figs. 7-14 and 7-15 
predict molecular properties that are in strikingly good qualitative agreement with 
experimental observations. The data in Table 7-2 show that, when we go from Hy 
to H2, adding a second electron to a bonding MO, the bond length decreases and the 
dissociation energy increases. Adding a third electron (He; ) causes partial occupation 
of the antibonding lou MO and causes the bond length to increase and the dissociation 
energy to decrease. The four electron molecule He» is not observed as is consistent 
with its configuration. The relative inertness of N2 as a chemical reactant becomes 
understandable from the fact that it has six more “bonding electrons" than antibonding 
electrons, giving a net of three bonds—one o and two x bonds. For N3 to react, it 
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TABLE 7-2 » Some Properties of Homonuclear Diatomic Molecules and Ions in their Ground 
Electronic States 


Equilibrium 

Net number Binding internuclear 

of bonding energy, separation, 
Molecule MO configuration electrons De (eV) R.(A) Term? 
H,* log 1 2.7928 1.06 23 
H2 lo 2 4.747745 0.7414 Um 
H,” los 16, 1 1.7% 0.8 2358 
He;* | log” oy 1 2.5 1.08 D 
He» lo; los" 0 0.001^ — 2.88 [lu 
He5- [He»]20g 1 No data XT 
Lint [He2]20g 1 1.29 3.14 Xa 
Lio [He2]20z? 2 1.05 2.673 IT 
Liz” [He2]204720y 1 ~1.302) 3.2 2X 
Bert | [He»]2og?20u 1 No definitive ?X,* 

data 

Be»? [He2]205;?20,? 0 0.1 2.49 D: dus 
Be27 [Bez]17tu 1 ~0.3 2.4 Tl, 
Bot [Be2]1 zu 1 1.8 = STIS 
Bo [Be2] 17u (?) 2 ~3 1.589 RC 
B27 [Bez] 1u? 3 No data Tl, 
C2* [Bez] lmu 3 5.3 1.301 Ta 
C2 [Be?]l z^ 4 6.36 1.2425 Un 
C27 [Be2]1 ru 30g 5 8.6 — ANT 
No* [Bez ]1 730% 5 8.86 1.116 sa 
N2 [Bez ]1 77307 6 9.90 1.098 ye 
N27 [Beg] 1744304" Lr, 5 ~8.3 — Tg 
O2* [Bez]1 ru 30g" 17g 5 6.7796 1.1171 Tg 
O2 [Be?]3og? 1u 17g? 4 5.2132 1.2075 D 
O27 [Be2]30g 1r, 17? 3 4.14 1.32 STE 
F;* [Be2]30g 1,417? 3 3.39 1.32 “Ts 
Fy [Be»]3og? 1r. ^ 15^ 2 1.65 142 at 
F27 [Be?]3og? 1z,*1z5*30u 1 ~1.3 1.9 SED 
Nez + [Be?]3og? 1, 17gf30u 1 ~1.l 1.7 25a t 
Ne» [Be2]3og? 1.315330? 0 0.003^ | 3.09 (CEt) 


“This state is unstable with respect to loss of an electron, but is stable with respect to dissociation into an atom 
and a negative ion. 

>From Hirschfelder et al. [2]. It may be shown that any two neutral atoms will have some range of R where 
the attractive part of the van der Waals' interaction dominates. For He», this minimum is so shallow and the 
nuclei so light that a stable state (including vibrations) probably cannot exist. For Ne», a stable state should 
exist. The data for He? and Ne» are calculated from considerations of intermolecular forces. 

©The term symbol corresponds to the configuration of column 2. 
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is necessary to supply enough energy to at least partially break these bonds prior to 
forming new bonds. The reversal of the orbital energy order between 17, and 3og can 
be seen to occur between Nz and Op. 

The configurations of Table 7-2 indicate that some molecules will have closed shells, 
whereas others will have one or more unpaired electrons. For example, O» has a 
configuration in which the degenerate 17r; level “contains” two electrons. Several states 
can be produced from such a configuration, just as several states could be produced from 
the 1s2s configuration for He. Hund's first rule has been found to hold for molecules, 
and so the state of highest multiplicity is lowest in energy. For n unpaired electrons, 
the highest multiplicity achievable is n + 1. For Oo, this is three, and we expect the 
ground state of the O2 molecule to be a triplet. This is conveniently demonstrated to be 
so by observing that liquid oxygen is attracted into the gap between poles of a magnet, 
consistent with the existence of uncanceled electron spin magnetic moments. 

Close perusal of Table 7-2 indicates that some of the data are not in accord with 
the simple qualitative ideas just presented. For example, Li; is more strongly bonded 
than is Liz, even though the former has fewer bonding electrons. However, the Lir 
ion-molecule is longer than Li». H is less strongly bound than isoelectronic Hej , yet 
it has a shorter equilibrium internuclear separation. Irregularities such as these require 
more detailed treatment. However, one of the useful characteristics of a qualitative 
approach is that it enables us to recognize cases that deviate from our expectations and 
therefore warrant further study. 

In Figs. 7-14 and 7-15, we saw how MOs are related to SOs for the separated 
atoms. Let us now consider how the separated atom SOs correlate with the united 
atom AOs. Recall that these orbitals are correlated by requiring them to be of identical 
symmetry. In Fig. 7-16 some of the possible SOs and united-atom AOs, together with 
their symmetry labels, are shown. Note that the og SOs can correlate with s or dọ AOs, 
Oy SOs with po AOs, zy SOs with py AOs and zg SOs with d; AOs. This gives us all 
the information we need except for resolving the ambiguities within a given symmetry 
type. For instance, which of the Is, 2s, 3s, 3d,,... in the AOs correlates with which of 
the og [1s], og[2s], og[2p], o,[3s], ... inthe SOs? This question is resolved by use of the 
noncrossing rule, which states that, in correlation diagrams, energy levels associated 
with orbitals or states of the same symmetry will not cross. This requires that we match 
up the lowest-energy united-atom AO of a given symmetry with the lowest energy SO 
of that symmetry, and so on up the ladder. This leads to the diagram in Fig. 7-17. The 
line interconnecting the 1s AO of the united atom with the 1s AOs of the separated 
atoms refers, at intermediate R, to the lo; MO. We have already seen that this MO may 
contain contributions from og[2s] and og[2p] SOs. Thus, the correlation diagram tells 
us what orbitals the log MO “turns into" at the limits of R, but does not imply that, at 
other R values, this MO is comprised totally of 1s AOs. 

Study of this correlation diagram reveals that the antibonding MOs (oy and zrg) are the 
MOs that correlate with higher energy united-atom AOs and hence favor the separated 
atoms in terms of energy. This results from the fact that antibonding MOs have a nodal 
plane bisecting the bond which is preserved as we proceed to the united atom, yielding 
a united atom AO having one more node than the separated AOs we started with. Thus, 
lso, goes to p, 2prrg goes to d, etc. The MO is said to be “promoted.” 

We have now seen several ways to explain the effects of an orbital. We may focus 
on energies, and note that bonding and antibonding MOs correlate with low-energy 
and high-energy united-atom orbitals, respectively. Or, as we saw earlier, we can focus 
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Figure 7-16 » (a) Symmetry orbitals for homonuclear diatomic molecules. (b) United-atom AOs 
characterized by symmetry with respect to z axis. 


on charge distributions and their attractions for nuclei, and note that bonding MOs 
concentrate charge in the bond region, attracting the nuclei together, whereas anti- 
bonding MOs shift charge outside the bond, attracting the nuclei apart. Alternatively, 
we can recognize that bonding MOs result from AOs on each atom coming together 
in phase, resulting in positive overlap, while antibonding MOs result from opposite 
phases coming together to give negative overlap. 
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Figure 7-17 » Correlation diagram between separated-atom orbitals and united-atom orbitals for 
homonuclear diatomic molecules. Energy ordinate and internuclear separation abscissa are only 
suggestive. No absolute values are implied by the sketch. [Note: For Hj 0g2p correlates with 
3dog, og3s with 3sog. This arises because the separability of the Hj hamiltonian (in the Born- 
Oppenheimer approximation) leads to an additional quantum number for this molecule. In essence, 
the Hy wavefunction in elliptical coordinates may be written w = L (A) M (u) ei"? The function L 
may have nodal surfaces of elliptical shape. M may have nodes of hyperbolic shape. In the correlation 
diagram for Hj , It is necessary that ellipsoidal nodes correlate with spherical nodes in the united 
atom, while hyperboloid nodes correlate with hyperboloid nodes (which may be planar). Sketching 
0g2p and og3s and comparing them with 3sog and 3dog (i.e., 3d4.». 2) makes clear how this “nodal 
control" results in what, at first sight, appears to be a violation of the noncrossing rule. For Hi, 
modifications for higher-energy states will also be required. For example, 4sog will correlate with 
og4s, not og3p.] 
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Three common labeling conventions are used in Fig. 7-17. A level may be labeled 
with reference to the separated atom AOs to which it correlates. The separated atom 
AO symbol is placed to the right of the MO symmetry symbol (e.g., o,2s). Note the 
absence of square brackets, which we used to symbolize the SO (2s 4 + 2s). The sym- 
bol og2s means “the MO of og symmetry that correlates with 2s AOs at R = oo." An 
alternative label indicates the united-atom orbital with which the MO correlates. Here 
the AO label is placed to the left of the symmetry symbol (e.g., 3poy). The u and g sub- 
scripts in the united-atom notation are redundant and are often omitted. However, they 
are helpful in drawing correlation diagrams. Finally, the MOs may be simply numbered 
in their energy order within each symmetry type, as mentioned earlier (e.g., 20g). 


EXAMPLE 7-3 A symmetry orbital is produced by taking 3d, a — 3dyz, p, where 
a and b are points on the z axis. Sketch the situation. What is the sign of the overlap 


between these AOs? Is this SO bonding or antibonding? What is the symmetry 
symbol for this SO? 


SOLUTION » The sketch would show two four-leafed AOs, each like the 3dy; AO shown in 
Fig. 7-16, except that the phase signs of one would be minus those in the matching lobes of the 
other. These AOs are positioned side by side, and their phases are such that the inner lobes pointing 
towards each other are in phase. Hence the overlap is positive and the SO is bonding. Looking at 
this SO along the z axis, it appears like a py AO, so it is x. Inversion causes interchange of lobes 
of opposite sign, so it has u symmetry. Its symbol, then, is zy. < 


Term symbols for electronic states of homonuclear diatomic molecules are much like 
those in atoms. The main symbol gives information about the component of electronic 
angular momentum along the z axis. If Mr =0, +1, +2, etc., then the main symbol is 
X, IL A, etc. Mr is the sum of m; values for the electrons. When identifying m;, we 
assume that the 2 MOs are complex, with m; = +1 and —1 rather than being mixed 
to give real 2, and 2, MOs. The main symbol is decorated with a superscript at left, 
giving spin multiplicity, and a g or u subscript at right for overall inversion symmetry. 
As an example, C}, with configuration 1021022022021; has Mz = +1 (zero for all 
o electrons and either 4-1, 4-1, —1 or +1, —1, —1 for the three z electrons) so its main 
symbol is IT. The inversion symmetry is u because there is an odd number of electrons 
in ungerade MOs. There is one unpaired electron, so the multiplicity is 2. The term, 
symbol is ? Tu (“doublet-pi-you”). 

X terms are given, in addition, a + or — superscript at the right, indicating whether the 
wavefunction is symmetric or antisymmetric for reflection through a plane containing 
the nuclei. Such a reflection has the effect of reversing the direction of $: Clockwise 
motion around the internuclear axis becomes counterclockwise when viewed in a mirror 
containing that axis. This transforms exp(i$) and exp(—i¢@) into each other, so that zr... 
and zr_ turn into (minus) each other. To achieve a & MO in the first place requires that 
m; values sum to 0. This happens for occupied equivalent m MOs only if 24 and z— 
are equally occupied, which means that both have one electron or both are full. The 
full case must yield ps , Le., must be symmetric for every operation, because each 
MO occurs twice in the electron list, and even antisymmetric functions give symmetric 
results when multiplied by themselves. For the half-filled case there are only two 
possibilities: 7: (1)z (2) xz (1), (2). The “plus” case is symmetric for electron 
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exchange, hence goes with the antisymmetric spin function of — Ba, hence is a singlet 
state. Reflection causes m+ <> 7—, but this returns the same function, so this case goes 
with a ! X * term. Similar reasoning shows that the “minus” case goes with a?X^ term. 
Hund's rule predicts the latter term to lie below the former in energy. 


EXAMPLE 7-4 What term symbols represent possible excited states of N2 produced 


by promoting an electron from the highest occupied MO to the lowest unoccupied 
MO? 


SOLUTION » When an electron is promoted, the possible resulting 17u, 17g configurations can 
be pictured as follows: 


1g + + | 
4 


> < 


-1 


+1 -1 +1 -1 


The excited molecule has two half-filled MOs, for which the electron spins can be opposite (singlet) 
or the same (triplet). The sum of electron orbital momenta will remain zero (giving X if the electron 
is excited from m; = —1 to — 1 or from 4-1 to +1, but will change to +2 or —2 (giving A) if changed 
from — 1 to +1 or from +1 to — 1. In all cases, the excited state has an odd number of electrons in 
the MOs of ungerade symmetry, so all terms will have a u subscript. The term symbols, then, are, 
respectively lyu. 3 Xu, 3 Au, l Au. 4 


These conventions for term symbols apply for any linear molecule having inversion 
symmetry (e.g., HCCH, CO»). For linear molecules lacking inversion symmetry (CO, 
HCN) all is the same except that there is no g-u symbol. 

The noncrossing rule mentioned above is an important aid in constructing correlation 
diagrams for many processes. It is called a rule rather than a law because it can only be 
shown to be highly improbable, not impossible, for two levels of the same symmetry to 
cross. Thus, imagine that we have a molecule with some variable parameter A and also 
with a symmetry operation R which is not lost as A varies. For example, à might be the 
H-O-H angle in water, and R could be reflection through the plane bisecting the H-O-H 
angle. Suppose that we have a complete set of basis functions and that, at each value 
of à, we manage to express exactly all but two of the eigenfunctions for the molecule. 
This uses up all but two dimensions of our function space, leaving us, at each value of À, 
with two eigenfunctions to determine and two functions in terms of which to express 
them. (These functions change with A, but the above argument has nevertheless served 
to reduce our problem to two dimensions.) Now let the two functions remaining from 
our original basis be mixed to become orthonormal and also individually either sym- 
metric or antisymmetric for R. We label these symmetrized basis functions x; and xo. 
Because we began with a complete basis, it must be possible to express the as yet 
undetermined wavefunctions y, and y» exactly as linear combinations of x; and x»? 
at each value of A. Furthermore, if yı and v» are, say, both antisymmetric for R, it 
is necessary that xı and x2 also both be antisymmetric. If y, and y» have opposite 
symmetries, however, x; and x» also have opposite symmetries. (In the latter case, x1 
and x» can only mix to produce unsymmetric functions, and so we know that x and x2 
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are already identical with y; and y2.) To determine the mixing coefficients and state 
energies for yı and y2, we solve the 2 x 2 secular equation over the basis x1, x2: 


Hi-E H 
a is =0 (1-107) 
Hi; Hn — E 
The roots are 
1 2 211/2 
Es = 5 | Hi + Hn + [4H + (Hi — Hz) ] | (7-108) 


The crossing of energy levels for Yı and y2 requires that, at some value of A, E+ equals 
E... From Eq. (7-108), we see that this requires that the term in square brackets vanish, 
which requires that Hi2 and Hj; — Ho» vanish. Now, if xı and x2 (and hence yı 
and y») have opposite symmetries for R, Hj» vanishes for all values of A, and the 
curves will cross whenever Hj; equals H22. But if x1 and x» (and hence yı and y2) 
have the same symmetry for R, Hi» is not generally zero. In this situation, the curve 
crossing requires that both H12 and H,, — Hz happen to pass through zero at the same 
value of A. This simultaneous occurrence of two functions passing through zero is so 
unlikely that it is safe to assume it will not happen. 

If the molecule possesses several elements of symmetry, H1» will vanish at all A if 
V and y» disagree in symmetry for any one of them, so the noncrossing rule applies 
only to states having wavefunctions of identical symmetry for all symmetry operations 
of the molecule. 

A similar treatment for orbitals and orbital energy levels is possible, and the non- 
crossing rule applies for orbital energies as well as for state energies. 


E 7-7 Basis Set Choice and the Variational Wavefunction 


One of the places where human decision can effect the outcome of a variational cal- 
culation is in the choice of basis. Some insight into the ways this choice effects the 
ultimate results is necessary if one is to make a wise choice of basis, or recognize which 
calculated results are “physically real" and which are artifacts of basis choice. 

One question we can ask is this: Is a minimal basis set equally appropriate for 
calculating an MO wavefunction for, say, B2 as F2? In each case we use 10 AOs 
and 2 spin functions producing a total of 20 spin MOs. With B2, however, we have 
10 electrons to go into these spin MOs, and in F2 we have 18 electrons. In all but 
the crudest MO calculations, the total energy is minimized in a manner that depends 
on the natures of only the occupied MOs. In effect, then, the calculation for B2 produces 
the 10 “best” spin MOs from a basis set of 20 spin-AOs, whereas that for F2 produces 
the 18 best MOs from a different basis set of 20 spin-AOs. In a sense, then, the basis 
for F» is less flexible than that for B2. Of course, the use of separated atom orbitals is a 
conscious effort to choose that basis that best spans the same function space as the best 
MOs. To the extent that this strategy is successful, the above problem is obviated (i.e., if 
both sets are perfect, additional flexibility is useless). The strategy is not completely 
successful, however, and comparison of results of minimal basis set calculations down 
a series of molecules such as B2, C2, N2, O2, and Fz may be partially hampered by this 
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ill-defined inequivalence in basis set adequacy. In contrast, comparison of calculated 
results in a series of molecules such as C4, H»,4? is much less likely to suffer from this 
particular problem because the minimal basis set grows with increasing n in a way to 
keep pace with the number of electrons. 

Let us now briefly consider how basis sets might vary in adequacy for different 
states of a given molecule. We will compare the wavefunction for the ground state of 
a molecule with the wavefunction for a Rydberg state. Rydberg states are so named 
because their spectral lines progress toward the ionization limit in a manner similar to the 
spectral pattern for hydrogenlike ions (called a Rydberg series).’ Hence, the Rydberg 
states of molecules are in some way like excited states of the hydrogen atom. This can 
be understood by visualizing an excited state for, say, N2 wherein one electron is, on 
the average, very far away from the rest of the molecule, which is now an Nj "core." 
As the excited electron moves to orbitals farther and farther out, the Nj core becomes 
effectively almost like a point positive charge. As a result, the coupling between the 
angular momentum of this orbital and the internuclear axis grows progressively weaker, 
so that the motion of the Rydberg electron becomes more and more independent of 
orientation of the core. It is not surprising that a hydrogen-like AO centered in the bond 
becomes more and more appropriate as a basis for describing this orbital. In contrast, 
such a "single-center" basis normally requires many terms to accurately describe MOs 
in ground or non-Rydberg excited states. Thus, for a Rydberg state of N2, one would 
do well to choose a basis set of AOs located on the nuclei to describe the MOs of the 
N; core, and to use an AO (or several AOs) centered between the nuclei to describe 
the orbital for the Rydberg electron. 

Thus far we have kept the discussion within the framework of homonuclear diatomic 
molecules. When we come to heteronuclear diatomics, for example, CO, we lose 
inversion symmetry and we can no longer symmetry balance our basis. This means 
that a given basis may be more inadequate for representing the wavefunction on one 
end of the molecule than on the other. As a result, the electronic charge will be shifted 
toward the end where the basis set is best able to minimize the energy. This charge 
shift is an artifact of basis set imbalance, but, since we have no way to evaluate this 
imbalance, it is difficult to tell how much it affects our results. Mulliken has published 
some calculations on the HF molecule that illustrate this problem in a striking way. 
Table 7-3 is a list of total energies and dipole moments calculated for HF using a variety 
of basis sets. 

The first column of data arises from a minimal basis set of STOs (1spg, Isp, 2sr, 2Pop, 
2Pzxp»2Pzy-) With orbital exponents evaluated from Slater's rules for atoms. The 
second column results if the orbital exponents are allowed to vary independently 
to minimize the molecular energy. The basis set for the third column is obtained 
by augmenting the previous basis with additional STOs centered on the H nucleus 
(2sH, 2Poy, 2Prxu» 2p 4). Finally, the fourth column results from use of a basis set 
that has been augmented (over the minimal basis) at both nuclei in a way thought to be 
appropriately balanced. As the basis set grows increasingly flexible, the average energy 
becomes lower, but the expectation value for the dipole moment does not converge uni- 
formly toward the observed value. In particular, by augmenting the basis on hydrogen 
only, we create a very unbalanced basis, which causes charge to shift too much toward 
the hydrogen end of the molecule. 


"See A. B. F. Duncan [3]. 


Section 7-8 Beyond the Orbital Approximation 233 


TABLE 7-3 » Energies and Dipole Moments for Hydrogen Fluoride Calculated by the Variation 
Method Using Different Basis Sets? 


Min STOF; Aug. Aug. STO 


Min STO Min STO STO H (very F and H 
Slater Z best ¢ unbalanced) (balanced) Exp 
E (a.u.) —99.4785 | —99.5361 —99.6576 —100.0580 —100.527 
L(H*F^) 0.85 D 1.44D 0.92 D 1.98D 1.82 D^ 


"^ See Mulliken [4]. 
PData from Weiss [5]. 


These problems with basis set adequacy are difficult to overcome completely. For- 
tunately, with a certain amount of experience, insight, and caution, it is nevertheless 
possible to carry out variational calculations and interpret their results to obtain reliable 
and useful information. 


E 7-8 Beyond the Orbital Approximation 


Most of our discussion of the variation method has been restricted to calculations within 
the orbital approximation. To avoid leaving an inaccurate impression of the capabilities 
of the variation method, we shall briefly describe some calculations on some small 
(two-electron) systems where the method can be employed to its fullest capabilities. 
These calculations are listed in Table 7-4. 

The calculation on He by Kinoshita expresses the spatial part of the wavefunction as 


wks, kt, ku) Se 57 $. crm ses) "(ku)" "(kt)" (7-109) 


l,m,n-0 
n,even 


where 
s=ri +m, ucro, t--—n-r (7-110) 


and k and c; m,n are variable parameters. The exponential term causes the wavefunc- 
tion to vanish as either electron goes to infinite r, and the terms in the sum build up 
a polynomial in one- and two-electron coordinates, reminiscent of the form of eigen- 
functions for the harmonic oscillator and the hydrogenlike ion. Kinoshita carried out 
his calculation to as many as 39 terms, obtaining an energy that he estimated to differ 
from the exact result by no more than 1.2 x 107 a.u. A subsequent calculation by 
Pekeris, using a related approach, required solving a secular determinant of order 1078 
and yielded an energy estimated to be accurate to 1.0 x 107? a.u. Applying corrections 
for coupling between electronic and nuclear motions, and also for relativistic effects, 
Pekeris arrived at a theoretical value for the ionization energy of He of 198310.687 cm7! 
compared to the experimental value of 198310.85 +0.15cm7!. 
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TABLE 7-4 » Results of Some Very Accurate Variational Calculations on 
Two-Electron Systems 


Estimated maximum 


Minimized _ energy error 
System energy” (a.u.) E — Exact (a.U.) 
He (ground singlet)? ^ —2.9037225 0.0000012 
He (ground singlet)! § —2.903724375 0.000000001 
He (lowest triplet) —2.17522937822 0.00000000001 


H, (ground singlet)? © —1.17447498302* 0.000000001 


“Uncorrected for nuclear motion and relativistic effects. 
>From Kinoshita [6]. 

From Pekeris [7]. 

dFrom Kolos and Wolniewicz [8]. 

“At R = 1.401078 a.u. 


In the 35 years since Pekeris work was reported, even more extensive calculations 
have been reported. For example, Drake et al. [9] have calculated an upper bound for 
the ground state of helium having 22 significant figures using a wavefunction having 
2358 terms. 

Extremely accurate variational calculations have been carried out on H2 by Kolos and 
Wolniewicz. They used elliptic coordinates and an r12 coordinate and expressed their 
wavefunction as an expansion in powers of these coordinates, analogous in spirit to the 
Kinoshita wavefunction described above. Their most accurate wavefunctions contain 
100 terms and are calculated for a range of R values. After including corrections for 
relativistic effects and nuclear motion, Kolos and Wolniewicz arrived at a theoretical 
value for the dissociation energy in H» of 36117.4cm7! compared to what was then 
the best experimental value 36113.6+0.5cm7!. Subsequent redetermination of the 
experimental value gave 36117.3 + 1.0 cm! .? 

The dissociation energy, Do, is the energy required to separate a molecule into its 
constituent atoms, starting with a molecule in its lowest vibrational state. The binding 
energy, De, is the energy for the corresponding process if we omit the vibrational 
energy of the molecule (see Fig. 7-18). These quantities are often much more sensitive 
measures of the accuracies of calculations than are total energies. The reason for this is 
easily understood when we recognize that the binding energy is a fairly small difference 
between two large numbers—the total energy of the molecule and the total energy of 
the separated atoms. Unless our errors in these two large energies are equal, the residual 
error is magnified (in terms of percentage) when we take the difference. Thus, the best 
total energy for H» in a certain orbital approximation is — 1.133629 a.u., which is 96.796 
of the total energy. However, the corresponding binding energy is —0.133629 a.u., 
which is 76.6% of the correct value. The need to calculate accurate binding energies is 
sometimes referred to as the need to achieve "chemical accuracy." 

The variational calculations cited above are among the most accurate performed, 
and they give an indication of the capabilities of the method. Properties other than 


8See Herzberg [10]. 
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Figure 7-18 » Schematic drawing showing the distinction between Do, the dissociation energy 
from the lowest vibrational level, and De, the binding energy, which does not take vibrational energy 
into account. The zero of energy is the energy of the separated atoms. 


energy predicted from such wavefunctions are also very accurate. For example, Pekeris’ 
best wavefunction for the first triplet state of helium gives an electron density at the 
nucleus of 33.18416 electrons per cubic bohr compared with the experimental value 
33.18388 + 0.00023 deduced from hyperfine splitting. For most systems of chemical 
interest, calculations of this sort become much too impractical to be considered. For 
this reason the orbital approximation, with all its limitations, is used in most quantum- 
chemical calculations on systems having more than two electrons. 


7-8.A Problems 


7-1. Given the following two functions, f(x) and g(x), for the range 0 > x > L: 


f(x) g(x) 


(a) (b) 


Figure P7-1 > 
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Can these functions be expressed accurately as linear combinations of particle- 
in-a-box eigenfunctions (box walls at x = 0, L)? Indicate your reasoning. If yes, 
what is the expression for the first two coefficients in the expansion? Can you 
evaluate any of these by inspection? 


7-2. Consider a particle in a box with a biased potential that is higher at x — L than 
at x =0. An approximate solution for the ground state could be ¢ = /0.9~ + 
J0.1 2, where y and yn are the first and second eigenfunctions for the unbiased 
box. (a) Make a rough sketch of $, showing how it skews the particle distribution. 
(b) What is the average kinetic energy for $, in terms of h, m, and L? 


7-3. The normalized function $ = (2/451)! ?? exp(—7) can be expanded in terms of 
hydrogen atom eigenfunctions: 


$ = c1 Yis + c2 Vos + c3 Y2po see 


where Yis = (1/4/7z) exp(—r) and yop, = (1/327 )r exp(—r/2) cos 0. Evaluate 
cı and ca. 


7-4. Given the approximate wavefunction for the lowest state of a particle in a one- 
dimensional box (Fig. P7-4): 


0 L/2 L 
Figure P7-4 » 
g — /3/LQx/L), 0<x<L/2 
$ —3/L[2Z(L — x)/L], L/2<x<L 
$ —0, x<0,x>L 


a) Resolve $ into the box eigenfunctions. That is, evaluate c, in the expression 


=> cnn, 
n=l 


where 
Wn =V2/Lsin(nax/L), O<x<L, w4,—-0, x<0,x>L 


b) Using the coefficients from part (a) compare the value of $ at x = L/2 with 
the values one obtains from the 


m 
Papprox = Y Cas, with m — 1,3,5, 7, and 9 


n=1 
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7-5. 


7-6. 


7-7. 


7-9. 


7-10. 


c) Use the coefficients from part (a) to obtain an expression for E appropriate 
for @. Estimate the value of the infinite series and thereby estimate E. Com- 
pare this value to Eexact. 


Let $ — exp(—ar?) be a trial function (not normalized) for the ground state of 
the hydrogen atom. Use the variation method to determine the minimum energy 
attainable from this form by variation of o. Find the average value of r and the 
most probable value of r for this wavefunction. Compare these r values and the 
average energy with the exact values. 


Let $ (o) = (o? / 32)!" y exp(—ar) be a trial function for the ground state of the 
hydrogen atom: 


a) Verify that the variation method gives a = 3, E= -i a.u. 


b) Verify that $ (3) has an overlap of 0.9775 with the 1s function. 
c) Find the value of o that maximizes the overlap of $ with the 1s function and 
determine the average energy of this new $. 


A normalized approximate wavefunction $ for the hydrogen atom is pro- 
jected into its components and found to be ¢ = (1/ V2) Wis + (4//4) oos + 
(1//8) 3s + casas. There are no higher terms. 


a) What is the value of c45? 
b) What is the value of the average energy? 


. A normalized, spherically symmetric variationally optimized function for the 


hydrogen atom is analyzed by projecting out coefficients C15, c2s, c2po, etc. It is 
found that cj, is equal to v0.80 and c», = v 0.15. 


a) What is the maximum possible value for c2p)? Explain your reasoning. 
b) What is the minimum possible value for E that could correspond to this 
function, based on the above data? Explain your reasoning. 


A normalized trial wavefunction of the form (in a.u.) 


¢=[(2¢)7/(4n0n]'” 


is variationally optimized to give the lowest possible energy for the hydrogen 
atom. The results part way along this process are f $ Hó dv =¢?/10 — £ /3. 


r? exp (—£r) 


a) Complete the variational process to obtain the optimum £ and the minimized 
average energy. 

b) Calculate the value of c; in the expression opt = C1 Wis + c2 Vos T. 

c) Produce a new, normalized function x that is orthogonal to yj, but is other- 
wise as similar as possible to @opt. (Use the unexpanded symbolic terms Yis 
and $ in your answer.) 

d) Suppose you are told that the average energy associated with your new func- 
tion x is —0.133 a.u. Do you find this reasonable? Explain your answer. 


$ = (à? /3n)!?r exp(—ar) is a normalized trial function for the hydrogen atom. 
The energy is minimized when a — 3/2. Calculate the average value of the 
potential energy predicted by this function if à = 3/2. 
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7-11. 


7-12. 


7-13. 


7-14. 


7-15. 


7-16. 
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Prove that optimized trial function (7-20) must contain contributions from con- 
tinuum wavefunctions. 


Compare the orbital exponent for a 1s AO in He as found by the variation method 
[Eq. (7-35)] with that given by Slater's rules (Chapter 5). 


A different trial function for calculating the polarizability of the hydrogen atom 
in a uniform electric field of strength F is 


V'trial = Wis(c1 + c22) 


This is somewhat similar to the example in the text, since zy, gives a p-like 
function, but not exactly the 2p, eigenfunction. 


a) Use this form to find an expression for the minimum E£ as a function of F. 
What value of E does this give for F = 0.1 a.u.? Can you suggest why this 
trial function is superior to the one used in the text? 

b) The polarizability is defined to be o in the expression 


1 l 5, 
Ec—--—-aF 
2. 2 
What value of o do you obtain? [Exact a = 4.5 a.u.] 1 a.u. of field strength is 
equal to e/ aĝ. Deduce the value of 1 a.u. of polarizability. 


Which hydrogen atom state should be more polarizable, the 1s or 2s? [Consider 
the factors that determine the extent of mixing between basis functions.] Explain 
your reasoning. 


$4 and dp are chosen to be a normalized set of basis functions for an LCAO 
wavefunction for a one-electron homonuclear diatomic system. It is found that 
the values for the integrals involving these functions are 


IL = —2 a.u., ILE a.u., 


^ 1 
f ioa — —]a.u., ID 


Find an upper bound for the exact lowest electronic energy for this system. Find 
the corresponding LCAO normalized approximate wavefunction. 


dq and ġp are chosen as the normalized basis functions for an LCAO wavefunc- 
tion for a one-electron, heteronuclear, diatomic molecule. It is found that the 
values for some integrals involving these functions are 


I^ dv = —2 a.u., | ede - a.u., 
1 


^ 1 
f doa --52u. f &o- - 


where Å is the molecular hamiltonian. Set up the secular determinantal equation 
and find the lowest electronic energy that can be computed from an LCAO 
wavefunction Caa + cpp. Find c, and cp such that E is minimized and the 
wavefunction is normalized. 
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7-17. 


7-18. 


7-19. 


7-20. 
7-21. 


7-22. 


7-23. 


7-24. 


7-25. 


A possible basis function for representing the log wavefunction of Hy is a 1s-like 
AO (¢ 2j x) E exp(—£r) located at the bond center. Assuming an internuclear 
separation of 2 a.u., find the ¢ value that minimizes Æ. Is this basis function 
adequate to predict a bound Hj molecule? [Use Appendix 3 to help you develop 
your formulas.] 


Without referring to the text, and by inspection, what is the united atom limit for 
the 1o, molecular orbital’s energy (in a.u.) for Hy ? 


Show that, at R = oo, the y4, and w_ wavefunctions for Hj are capable of 
describing a state wherein the electron is in a 1s orbital on atom A. 


Evaluate Eqs. (7-89) and (7-90) at R =0 to show that H44 = Hg at this point. 


Examining Eq. (7-86), and letting H4p =kH,, what relationship between k 
and S4g is necessary if the og MO is to be lower in energy than the o; MO? 
[Assume that H44 is negative, and that k and Sz are positive.] 


Consider the one-electron molecule-ion HeH?* : 


a) Write down the hamiltonian operator (nonrelativistic, Born-Oppenheimer 
approximation) for the electronic energy in atomic units for this system. 

b) Calculate the electronic energies for the lowest energy state of this system in 
the separated atom and united atom limits. 


For a homonuclear diatomic molecule aligned as shown in Fig. P7-23, character- 
ize each of the following MOs as ø, 7x, ô, and g or u, and bonding or antibonding. 


a) 2py, ES 2py, 
b) 2pz, T 2pz, 
c) 3d + 3d 
d) 3d,,, + 3dyy, 
e) 3d,z, — 3dxz, 


Figure P7-23 » 


Characterize each of the following atomic orbitals with the symbols o, x, ô, and 
also g or u. Let the z axis be the reference axis for angular momentum. 


Is 2p; 3p, 3d, 
2s 2p, 3d» 3d, 


Indicate whether you expect each of the following homonuclear diatomic MOs 
to be bonding or antibonding. Sketch the MO in each case: (a) oy (b) zt, (c) dg 
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7-26. 


7-27. 


7-28. 


7-29. 


7-30. 


7-31. 


7-32. 
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A homonuclear diatomic molecule MO of zt, symmetry is to be expressed as a 
linear combination of AOs centered on the nuclei, which lie on the z axis. Which 
of the AOs in the following list can contribute to the MO? 


Is 2s 2p, 2py 2p; 3s 3px 
3py 3pz 3dxy 3d,; 3d, 3d» 3d,2_,2 


Use sketches and symmetry arguments to decide which of the following integrals 
vanish for diatomic molecules (the x, y, and z axes are shown in Fig. P7-23): 


a) f 2pzals_dv 

b) f 2Pyalsp dv 

c) f 2pza2pyp dv 
d) f 2pya3dyzp dv 
e) f 2Pza3dyz dv 
f) f 15a H2pya dv 
g) f Isa Ĥ2pza dv 


Show that, for reflection through a plane containing the nuclei, if the MO à,» 2 
is symmetric, then the MO ôy, is antisymmetric. Show that the same is true for 
x MOs constructed from dyz and dy; AOs. (The internuclear axis is assumed to 
lie along the z coordinate.) 


Assuming the internuclear axis to lie along the z coordinate, what are the possible 
Mz quantum numbers for an MO constructed from 3d; — 32 ? 
D 


In a homonuclear diatomic correlation diagram, what MO symmetry symbols 
(c, x, 6, g, u) could correlate with each of the united atom AOs listed below? 
Assume z to be the “old” internuclear axis. Indicate for each case whether this 
united atom orbital is the terminus for a bonding or an antibonding MO. (a) 2p; 
(b) 2px (c) 3d; (d) 3dxy 


A homonuclear diatomic system has the ground-state MO configuration 
lo tle, 2a, 20030. lala: 


a) What is the net number of bonding electrons? 

b) What spin multiplicity would you expect for the ground state? 

c) What would you expect the effect to be on the dissociation energy of this 
molecule of ionization (1) from the 1zz; MO? (2) from the 30g MO? 

d) Upon ionization (one-electron) from the 17g level, what would be the spin 
multiplicity of the resulting ion? 

e) To what type of united atom AO does the zu MO correlate? 


a) Without referring to the text, write out the ground state configuration for 05 
using MO symmetry symbols (dog etc.) 

b) What is the net number of bonding electrons? 

c) How does the dissociation energy for this ion compare to that for O2? 

d) What is the ground state term symbol for this ion? 

e) Which occupied MOs may contain contributions from 2p, AOs, assuming z 
to be the internuclear axis? 
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7-33. 


7-34. 


7-35. 


The reduced symmetry of heteronuclear (compared to homonuclear) diatomic 
molecules results in their having a different correlation diagram. Set up a corre- 
lation diagram for heteronuclear diatomics. Be sure to indicate that the energy 
levels of each type of AO are not identical for the separated atoms. Comparing 
correlation diagrams for homonuclear and heteronuclear molecules, does it seem 
reasonable that He» is unstable, whereas the isoelectronic LiH and LiHe* are 
stable molecules? 


Following are some Slater orbital coefficients for some MO’s of F calculated 
by Ransil [11] (the 2p, STOs are defined according to z axes pointing from each 
atom toward the other): 


log C154-C1s,B — 0.70483 20g Cis,A=Cls,B = 0.17327 
C2s. A = C2s,B = 0.00912 C2s, A = Cos, B = —0.67160 
C2p,,4 = C2p,,B = —0.00022 C2p,,d = C2p,, B = —0.08540 


We see that the log MO is almost entirely made from 1s AOs on A and B. 
However, the 20, MO contains what appears to be an anomalously large amount 
of 1s AO. This turns out to be an artifact of the fact that Slater-type 2s orbitals 
are not orthogonal to 1s AOs on the same center. For F2, the STO 1s, 2s overlap 
is 0.2377. Use this fact to construct a new orbital, 2s’, that is orthogonal to 1s. 
Express the 20g MO of Ransil in terms of the basis functions Is, 2s’, and 2p, on 
centers A and B. You should find the 1s coefficients much reduced. 


Re for Hy equals 2.00 a.u. At this distance, Eejec = —1.1026 a.u. What is the 
value of D, for Hy ? 


Multiple Choice Questions 


(Try to answer these without referring to the text.) 


1. A homonuclear diatomic MO is given by $ = 2 p,q + 2 pz p, where the z axis is the 
same as the internuclear axis. Which one of the following statements about $ is 
correct? 


a) ¢ is an antibonding MO, symbolized oy. 
b) $is a bonding MO, symbolized zy. 
C) $ is an antibonding MO, symbolized 7r. 
d) ¢ is a bonding MO, symbolized og. 
e) $ is an antibonding MO, symbolized zy. 


2. For the three species No, Nj , N; , which one of the following orders for the bond 
energy (i.e., bond strength) is most reasonable? 


a) N22 Nj »N; 
b) Nj 2No-N; 
c) Ny >N2 >N} 
d) N; >N} >No 
e) Only N2 forms a bond; Nj and N, do not. 
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3. According to the LCAO-MO model, which one of the following second period 
diatomic molecules has a double bond in the ground electronic state? 


a) Liz 
b) Bez 
c) B2 
d) C2 
e) N2 


4. Which one of the following statements concerning Hy is false? 


a) The nondegenerate LCAO-MOs (without spin) must be either symmetric or anti- 
symmetric for inversion. 

b) The lowest energy MO (without spin) of the molecule is antisymmetric for 
inversion. 

c) The MOs transform into the AOs of the helium ion as the two nuclei are fused 
together. 

d) The ground state has a multiplicity of two. 

e) The Born-Oppenheimer approximation permits finding the solution for the purely 
electronic wave function at each value of the internuclear distance. 


5. Which one of the following statements is false for bonding MOs formed from linear 
combinations of AOs on atoms a and b? 


a) Only AOs that have nonzero overlap can form bonding MOs. 

b) Only AOs that have similar energies can form strongly bonding MOs. 

c) Bonding MOs cannot have a nodal plane perpendicular to the internuclear axis 
and midway between a and b. 

d) A p AO on b can combine with a p AO on a to form o, z, or 5 MOs. 

e) A maximum of three bonding MOs can be formed from 2p AOs on a and b. 
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Chapter 8 


The Simple Hückel Method 
and Applications 


N 8-1 The Importance of Symmetry 


Our discussions of the particle in a box, the harmonic oscillator, the hydrogen atom, and 
homonuclear diatomic molecules have all included emphasis on the role that symmetry 
plays in determining the qualitative nature of the eigenfunctions. When we encounter 
larger systems, detailed and accurate solutions become much more difficult to perform 
and interpret, but symmetry continues to exert strong control over the solutions. 

In this chapter, we will describe a rather simple quantum chemical method that was 
formulated in the early 1930s by E. Hückel. One of the strengths of this method is 
that, by virtue of its crudeness and simplicity, the effects of symmetry and topology on 
molecular characteristics are easily seen. Also, the simplicity of the model makes it an 
excellent pedagogical tool for illustrating many quantum chemical concepts, such as 
bond order, electron densities, and orbital energies. Finally, the method and some of 
its variants continue to be useful for certain research applications. Indeed, it is difficult 
to argue against the proposition that every graduate student of organic and inorganic 
chemistry should be acquainted with the Hückel molecular orbital (HMO) method. 


| J 8-2 The Assumption of o-r Separability 


The simple Hückel method was devised to treat electrons in unsaturated molecules like 
ethylene and benzene. By 1930 it was recognized that unsaturated hydrocarbons are 
chemically more reactive than are alkanes, and that their spectroscopic and thermody- 
namic properties are different too. The available evidence suggested the existence of 
loosely held electrons in unsaturated molecules. 

We have already seen that, when atoms combine to form a linear molecule, we 
can distinguish between MOs of type ø, 7r, ô, ... depending on whether the MOs are 
associated with an m quantum number of 0, +1, +2,... Thus, in acetylene (C2H2), the 
minimal basis set of AOs on carbon and hydrogen lead to o and z MOs. Let us imagine 
that our acetylene molecule is aligned along the z Cartesian axis. Then the p, -type 
AOs on the carbons are antisymmetric for reflection through a plane containing the 
molecular axis and the y axis. Similarly, the p, r-type AOs are antisymmetric for 
reflection through a plane containing the molecular axis and the x axis. The pz AOs, 
which are c -type functions, are symmetric for reflection through any plane containing 
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the molecular axis. It has become standard practice to carry over the o—z terminology 
to planar (but nonlinear) molecules, where m is no longer a “good” quantum number. 
In this expanded usage, a x orbital is one that is antisymmetric for reflection through 
the plane of the molecule, a o orbital being symmetric for that reflection. 

Hückel found that, by treating only the z electrons explicitly, it is possible to repro- 
duce theoretically many of the observed properties of unsaturated molecules such as 
the uniform C-C bond lengths of benzene, the high-energy barrier to internal rotation 
about double bonds, and the unusual chemical stability of benzene. Subsequent work 
by alarge number of investigators has revealed many other useful correlations between 
experiment and this simple HMO method for zr electrons. 

Treating only the x electrons explicitly and ignoring the o electrons is clearly an 
approximation, yet it appears to work surprisingly well. Physically, Hückel's approxi- 
mation may be viewed as one that has the x electrons moving in a potential field due 
to the nuclei and a “o core,’ which is assumed to be frozen as the m electrons move 
about. Mathematically, the o—z separability approximation is 


Eit = Eg + Ex (8-1) 


where Zio; is taken to be the electronic energy E, plus the internuclear repulsion 
energy Von. 

Let us consider the implications of Eq. (8-1). We have already seen (Chapter 5), 
that a sum of energies is consistent with a sum of hamiltonians and a product-type 
wavefunction. This means that, if Eq. (8-1) is true, the wavefunction of our planar 
molecule should be of the form (see Problem 8-1) 


wd,... M=Wrl,... vs (k- 1, .... n) (8-2) 
and our hamiltonian should be separable into x and o parts: 
J0,2,..., n 244,0,2,.... O A (k 1... un) (8-3) 


Equations (8-2) and (8-3) lead immediately to Eq. (8-1): 


f vivi (r+ Ho) ets dca.... m 
[Via edt... .n) 
[iud O o f yžopodilk+1,...,n) 


f ViVadTQ,.... E) | fwiWedt(k+1,...,n) 
= E. Eo (8-4) 


E= 


If these equations were valid, one could ignore Yo and legitimately minimize E; by 
varying Yr, But the equations are not valid because it is impossible to rigorously satisfy 
Eq. (8-3). We cannot define A, and Hz so that they individually depend completely 
on separate groups of electrons and still sum to the correct total hamiltonian. Writing 
these operators explicitly gives 


k k k k 

P 1 5 Lait 1 

Pal.. k= 5) v7 +) Vae (i) +5) 3 d (8-5) 
i-l i=l i=l j=1,jAi 


. la n T n 1 
Žolk+1,... n) =-3 9, Vit Vet, 2, Vm 86) 


i=k+1 i=k+1 i=k+1 j=k+1,jżi " 
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where Vre (i) represents the attraction between electron i and all the nuclei. These 
hamiltonians do indeed depend on the separate groups of electrons, but they leave out 
the operators for repulsion between o and z electrons: 


k n 
x A ^ 1 
i=l j=k+1 lij 


In short, the o and z electrons really do interact with each other, and the fact that the 
HMO method does not explicitly include such interactions must be kept in mind when 
we consider the applicability of the method to certain problems. Some account of o—7t 
interactions is included implicitly in the method, as we shall see shortly. 


E 8-3 The Independent z-Electron Assumption 


The HMO method assumes further that the wavefunction Yy is a product of one-electron 
functions and that the hamiltonian Z7, is a sum of one-electron operators. Thus, for 
n7 electrons, 


Va (1,2, ..., n) = $i(010)06;Q) ...or(n) (8-8) 
4 0,2,..., = He (1) + Êr (2) +++: + An (n) (8-9) 


and 


SDAD dA) — 


= E; (8-10) 
6%; (oir (1) 
It follows that the total x energy E; is a sum of one-electron energies: 
Ej =Ei+E;+ et Ei (8-11) 


This means that the x electrons are being treated as though they are independent of each 
other, since Æ; depends only on $; and is not influenced by the presence or absence of 
an electron in $;. However, this cannot be correct because x electrons in fact interact 
strongly with each other. Once again, such interactions will be roughly accounted for 
in an implicit way by the HMO method. 

The implicit inclusion of interelectronic interactions is possible because we never 
actually write down a detailed expression for the zr one-electron hamiltonian operator 
A, (i). (We cannot write it down because it results from a 7-0 separability assumption 
and an independent z -electron assumption, and both assumptions are incorrect.) Hi) 
is considered to be an "effective" one-electron operator—an operator that somehow 
includes the important physical interactions of the problem so that it can lead to a reason- 
ably correct energy value £;. A key point is that the HMO method ultimately evaluates 
Ej via parameters that are evaluated by appeal to experiment. Hence, it is a semiem- 
pirical method. Since the experimental numbers must include effects resulting from 
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all the interelectronic interactions, it follows that these effects are implicitly included 
to some extent in the HMO method through its parameters. 

It was pointed out in Chapter 5 that, when the independent electron approximation 
[Eqs. (8-8)-(8-11)] is taken, all states belonging to the same configuration become 
degenerate. In other words, considerations of space-spin symmetry do not affect the 
energy in that approximation. Therefore, the HMO method can make no explicit use of 
spin orbitals or Slater determinants, and so Yy is normally taken to be a single product 
function as in Eq. (8-8). The Pauli principle is provided for by assigning no more than 
two electrons to a single MO. 


EXAMPLE 8-1 If O2 were treated by the HMO method, what would be the form of 


the wavefunction and energy for the ground state? 


SOLUTION » The ground state configuration for O5 is log 107202207302 y. Im, s x 
Ug. xlzg, y, where we have shown the degenerate members of x levels explicitly and in their 
real forms. The HMO wavefunction is simply a product of the pi MOs, one for each of the 
six pi electrons: Iz, x (D) Eztu,x (2) 12ty, y) Ut, y (4) Etg, x (5) Ig, (6). The HMO energy is 
2 Eg, ux 2 En, u, y. En, g,x + Eng, y, which reduces to 4 Ez, + 2Ez, g. Note that, because Op is 
linear, there is no unique molecular plane containing the internuclear axis. Therefore this molecule 
has two sets of zt MOs, one pair pointing in the x direction, the other pair pointing along y. Fora 
planar molecule, only one of these pairs would qualify as z MOs, as will be seen in the next section. 

4 


[| 8-4 Setting up the Hückel Determinant 


8-4.A Identifying the Basis Atomic Orbitals and Constructing 
a Determinant 


The allyl radical, C3Hs, is a planar molecule! with three unsaturated carbon centers 
(see Fig. 8-1). The minimal basis set of AOs for this molecule consists of a 1s AO 
on each hydrogen and Is, 2s, 2px, 2p,, and 2p, AOs on each carbon. Of all these 
AOs only the 2pz AOs at the three carbons are antisymmetric for reflection through the 
molecular plane. 


Figure 8-1 » Sketch of the nuclear framework for the allyl radical. All the nuclei are coplanar. 
The z axis is taken to be perpendicular to the plane containing the nuclei. 


l The minimum energy conformation of the allyl system is planar. We will ignore the deviations from planarity 
resulting from vibrational bending of the system. 
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Figure 8-2 » The three x-type AOs in the minimal basis set of the allyl radical. 


Following Hückel, we ignore all the o-type AOs and take the three 2p; AOs as our 
set of basis functions. Notice that this restricts us to the carbon atoms: the hydrogens 
are not treated explicitly in the simple HMO method. We label our three basis functions 
X1» X2, X3 as indicated in Fig. 8-2. We will assume these AOs to be normalized. 

Suppose that we now perform a linear variation calculation using this basis set. We 
know this will lead to a 3 x 3 determinant having roots that are MO energies which can 
be used to obtain MO coefficients. The determinantal equation is 


Hi —ESui — Hig—ESi?. His — ES13 
Hb — ESx) Fog — ESx2 H»s-ES3-0 (8-12) 
AB) — E83; Hy; — E632 Hs — E'S33 


where 
Hy = f xf tv (8-13) 
Sij = f xx (8-14) 


Since H;; and S;; are integrals over the space coordinates of a single electron, the 
electron index is suppressed in Eqs. (8-13) and (8-14). 


8-4.B The Quantity « 


We have already indicated that there is no way to write an explicit expression for Ay 
that is both consistent with our separability assumptions and physically correct. But, 
without an expression for A, how can we evaluate the integrals H;;? The HMO 
method sidesteps this problem by carrying certain of the Hj; integrals along as symbols 
until they can be evaluated empirically by matching theory with experiment. 

Let us first consider the integrals H11, H22 and H33. The interpretation consistent 
with these integrals is that H11, for instance, is the average energy of an electron in AO 
x1 experiencing a potential field due to the entire molecule. Symmetry requires that 
H1 = H33. Ho» should be different since an electron in AO x» experiences a different 
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environment than it does when in x1 or x3. It seems likely, however, that H2» is not 
very different from H11. In each case, we expect the dominant part of the potential to 
arise from interactions with the local carbon atom, with more distant atoms playing a 
secondary role. Hence, one of the approximations made in the HMO method is that all 
Hii are identical if x; is on a carbon atom. The symbol o is used for such integrals. 
Thus, for the example at hand, Hj; = H»» = M33 — a. The quantity o is often called 
the coulomb integral.” 


8-4.C The Quantity £ 


Next, we consider the resonance integrals or bond integrals Hi», Fb, and Hija. (The 
requirement that H, be hermitian plus the fact that the x’s and ÉL, are real suffices 
to make these equal to M21, H32, and H31, respectively.) The interpretation consistent 
with these integrals is that H12, for instance, is the energy of the overlap charge between 
x1 and x». Symmetry requires that H1» = H»3 in the allyl system. However, even when 
symmetry does not require it, the assumption is made that all Hj; are equal to the same 
quantity (called 8) when i and j refer to "neighbors" (i.e., atoms connected by a o 
bond). It is further assumed that H;; = 0 when i and j are not neighbors. Therefore, 
in the allyl case, 


H = H3 = p, Hj; =0. 


8-4.D Overlap Integrals 


Since the x's are normalized, S;; = 1. The overlaps between neighbors are typically 
around 0.3. Nevertheless, in the HMO method, all S;; (i Æ j) are taken to be zero. 
Although this seems a fairly drastic approximation, it has been shown to have little 
effect on the qualitative nature of the solutions. 


8-4.E Further Manipulation of the Determinant 


Our determinantal equation for the allyl system is now much simplified. It is 


a—E B 0 
Ê a— E B |-0 (8-15) 
0 p a—E 
Dividing each row of the determinant by 6 corresponds to dividing the whole deter- 


minant by 6°. This will not affect the equality. Letting (y — E)/f =x, we obtain the 
result 


x 0 
1 x 1|=0 (8-16) 
0 1 x 


?The term “coulomb integral” for œ is unfortunate since the same name is used for repulsion integrals of the 
form f X1(0D0 x2 Q)(1/r12) x11) x22) dv. The quantity a also contains kinetic energy and nuclear-electronic 
attraction energy. 
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Figure 8-3 » HMO determinants for some small systems. 


which is the form we will refer to as the HMO determinantal equation. Notice that x 
occurs on the principal diagonal, 1 appears in positions where the indices correspond 
to a bond, 0 appears in positions (e.g., 1,3) corresponding to no bond. This gives us a 
simple prescription for writing the HMO determinant for any unsaturated hydrocarbon 
system directly from a sketch of the molecular structure. The rules are (1) sketch 
the framework defined by the n unsaturated carbons; (2) number the atoms 1,... , 7 
(the ordering of numbers is arbitrary); (3) fill in the n x n determinant with x’s on the 
diagonal, 1’s in positions where row column indices correspond to bonds, 0’s elsewhere. 
See Fig. (8-3) for examples. As a check, it is useful to be sure that the determinant is 
symmetric for reflection through the diagonal of x's. This is necessary since, if atoms 
i and j are neighbors, 1’s must appear in positions i, j and j,i of the determinant. 

Since the Hückel determinant contains only information about the number of unsat- 
urated carbons and how they are connected together, it is sometimes referred to as a 
topological determinant. (Topology refers to properties that are due to the connected- 
ness of a figure, but are unaffected by twisting, bending, etc.) 


E 8-5 Solving the HMO Determinantal Equation 
for Orbital Energies 


The HMO determinantal equation for the allyl system (8-16) can be expanded to give 
x!—2x20 (8-17) 
or 
x(x? —2)=0 (8-18) 


Thus, the roots are x =0, x = V2, and x =—V2. Recalling the definition of x, these 
roots correspond respectively to the energies E=a, E =œ — 2p, E=a+ 2f. 
How should we interpret these results? Since o is supposed to be the energy of 
a pi electron in a carbon 2p AO in the molecule, we expect this quantity to be neg- 
ative (corresponding to a bound electron). Since f refers to an electron in a bond 
region, it too should be negative. Therefore, the lowest-energy root should be E, = 
o + /2B, followed by E» =a, with E3 =a — /28 being the highest-energy root. 
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a — /20 


—pn— —H— —HH-—— a + J/28 
CHS C, H; C3H,~ 
Figure 8-4 »  ;r-Electron configurations and total energies for the ground states of the allyl cation, 
radical, and anion. 


(It is convenient to number the orbital energies sequentially, starting with the lowest, 
as we have done here.) 

We have just seen that bringing three 2p, AOs together in a linear arrangement 
causes a splitting into three MO energy levels. This is similar to the splitting into two 
energy levels produced when two 1s AOs interact, discussed in connection with Hj. In 
general, n linearly independent separated AOs will lead to n linearly independent MOs. 

The ground-state zr -electron configuration of the allyl system is built up by putting 
electrons in pairs into the MOs, starting with those of lowest energy. Thus far, we have 
been describing our system as the allyl radical. However, since we have as yet made 
no use of the number of x electrons in the system, our results so far apply equally well 
for the allyl cation, radical, or anion. 

Configurations and total x energies for these systems in their ground states are 
depicted in Fig. 8-4. The total x -electron energies are obtained by summing the one- 
electron energies, as indicated earlier. 


EXAMPLE 8-2 For a planar, unsaturated hydrocarbon having formula C,H, where 


all the carbons are part of the unsaturated framework, how many pi MOs are there? 


SOLUTION » Each carbon atom brings one 2p; AO into the basis set, so there are x basis AOs. 
These x independent AOs mix to form x independent MOs. < 


E 8-6 Solving for the Molecular Orbitals 


We still have to find the coefficients that describe the MOs as linear combinations 
of AOs. Recall from Chapter 7 that this is done by substituting energy roots of the 
secular determinant back into the simultaneous equations. For the allyl system, the 
simultaneous equations corresponding to the secular determinant (8-16) are 


cix + c3 =0 (8-19) 
Cl cax+c3 =0 (8-20) 
€? +03x=0 (8-21) 


(Compare these equations with the secular determinant in Eq. (8-16) and note the 
obvious relation.) As we noted in Chapter 7, homogeneous equations like these can give 
us only ratios between c1, c2, and c3, not their absolute values. So we anticipate using 
only two of these equations and obtaining absolute values by satisfying the normality 
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condition. Because we are neglecting overlap between AOs, the latter step corresponds 
to requiring 

e+h4+G=1 (8-22) 


The roots x are, in order of increasing energy, —A/2, 0, -/2. Let us take x = —/2 
first. Then 


—A/2c4- C2 =0 (8-23a) 
ci—N2c34-  c420 (8-23b) 
c?—/2cs —0 (8-23c) 


Comparing Eqs. (8-23a) and (8-23c) gives cı = c3. Equation (8-23a) gives c2 = J2c}. 
Inserting these relations into the normality equation (8-22) gives 


2 
ct (2a) +c7=1 (8-24) 
: 1 
4c] — 1, c) = Eg (8-25) 


It makes no difference which sign we choose for c; since any wavefunction is equivalent 
to its negative. (Both give the same V?.) Choosing c; = +5 gives 
1 1 1 


=-, =—, =- 8-26 

C1 5 65) Ja C3 2 ( ) 
These coefficients define our lowest-energy MO, $i: 
1 1 1 


P1= 5X1 | y 2 


A similar approach may be taken for x =0 and x = +2. The results are 


x3 (8-27) 


1 1 
=0): @&=—y-— 8-28 
(x0): $4» Jn a ( ) 
1 1 
(942): &- RNC CEST (8-29) 


The allyl system MOs are sketched in Fig. 8-5. 

The lowest-energy MO, 41, has no nodes (other than the molecular-plane node 
common to all z MOs) and is said to be bonding in the Cı — C2 and C2 — C3 regions. It 
is reasonable that such a bonding MO should have an energy wherein the bond-related 
term f acts to lower the energy, as is true here. The second-lowest energy MO, ¢2, 
has a nodal plane at the central carbon. Because there are no m AOs on neighboring 
carbons in this MO, there are no interactions at all, and 6 is absent from the energy 
expression. This MO is said to be nonbonding. The high-energy MO, $3 has nodal 
planes intersecting both bonds. Because the 2 AOs show sign disagreement across both 
bonds, this MO is everywhere antibonding and f terms act to raise the orbital energy 
above æ. 


EXAMPLE 8-3 According to HMO theory, do the z electrons favor a linear, or a 


bent allyl radical? 
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(a) (b) 


Figure 8-5 » Sketches of the allyl system MOs. (a) emphasizes AO signs and magnitudes. 
(b) resembles more closely the actual contours of the MOs. 


SOLUTION » HMO theory favors neither. The difference between linear and bent allyl shows 
up as a difference in Cj — Cz — C3 angle and in C, to C3 distance. The HMO method has no 
angular-dependent features and explicitly omits interactions between non-neighbor carbons, like 
C; and C3. < 


E 8-7 The Cyclopropenyl System: Handling Degeneracies 


The allyl system results when three x AOs interact in a linear arrangement wherein 
Hi» = H3 = p, but Hj3 — 0. We can also treat the situation where the three zz AOs 
approach each other on vertices of an ever-shrinking equilateral triangle. In this case, 
each AO interacts equally with the other two. This triangular system is the cyclo- 
propenyl system C3H3 shown in Fig. 8-6. 

The HMO determinantal equation for this system is 


x 1 1 
1 x 1|20, x°+2-—3x=0 (8-30) 
1 1 x 

H 

\ 

C 

Me 
"d Ny 


Figure 8-6 » The cyclopropenyl system (all nuclei are coplanar). 
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This equation can be factored as 


(x--2)(x — 1)(x —1)20 (8-31) 


Therefore, the roots are x = —2, +1, +1. 

Since the root x — 1 occurs twice, we can expect there to be two independent HMOs 
having the same energy-a doubly degenerate level. The energy scheme and ground 
state electron configuration for the cyclopropenyl radical (three x electrons) (I) gives 
a total E; of 3o 4- 38. We can surmise from these orbital energies that $; is a bonding 
MO, whereas $» and $3 are predominantly antibonding. To see if this is reflected in the 
nodal properties of the MOs, let us solve for the coefficients. The equations consistent 
with the HMO determinant and with orbital normality are 


cx +02 c3 —0 
c1 cox - c3 —0 (8-32) 
cp tc c3x =0 
c ci ci =1 
Setting x = —2 and solving gives 
$ 1 1 1 (8-33) 
‘=a x23 x3 3 
B 43^. X3 


— Bim Eee (x 2 +1) 
oo E, =a + 28 (x = —2) 
(D 


For this MO, the coefficients are all of the same sign, and so the AOs show phase 
agreement across all bonds and all interactions are bonding. 

To find $» and $3 is trickier. We begin by inserting x = +1 into our simultaneous 
equations. This gives 


cı +c2+c3=0 (three times) (8-34) 
et+h+G=1 (8-35) 


With three unknowns and two equations, an infinite number of solutions is possible. 
Let us pick a convenient one: c; = —c5, c3 = 0. The normalization requirement then 
gives cj = 1/42, C2 = IA. c3 — 0. Let us call this solution $»: 


$2 (8-36) 


1 1 
= AXI FX2 
V2 X2 
We still need to find $3. There remain an infinite number of possibilities, so let us pick 


one: cj = 1/ J2, co 20,c4 ——1 / J/2. We have used our experience with $» to choose 
c’s that guarantee a normalized $5. Also, it is clear that $3 is linearly independent of 
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$» since they contain different AOs. But it is desirable to have $3 orthogonal to $». 
Let us test $? and $5 to see if they are orthogonal: 


S= | 263 dv = j | Qa — x30 — x3) dv 


-aff gto [ yio- | gatto | uctor] = i (8-37) 


Since S Z 0, $» and $3 are nonorthogonal. We can project out that part of $3 that 
is orthogonal to ¢2 by using the Schmidt orthogonalization procedure described in 
Section 6-10. We seek a new function $3 given by 


$5 = $3 — Sho (8-38) 
where 
S= J $203dv = ; (8-39) 
Therefore, 
, 1 1 
$3—$3— 242 = 2x3) (8-40) 


(x1 T x2 
PARERE HE 


This function is orthogonal to ¢2 but is not normalized. Renormalizing gives 


$3 = = Ga + x2 — 2x3) (8-41) 
In summary, to produce HMO coefficients for degenerate MOs, pick any two indepen- 
dent solutions from the infinite choice available, and orthogonalize one of them to the 
other using the Schmidt (or any other) orthogonalization procedure. 

The MOs for the cyclopropenyl system as seen from above the molecular plane are 
sketched in Fig. 8-7. The MO @2 can be seen to have both antibonding (C;—C2) and 
nonbonding (C;—C3, C2-C3) interactions. #4 has antibonding (C1-C3, C2—C3) and 
bonding (C1-C») interactions. The interactions are of such size and number as to give 
an equal net energy value (œ — f) in each case. Since nodal planes produce antibonding 
or nonbonding situations, it is not surprising that higher and higher-energy HMOs in a 


3o oo $$ 


| 
l 
| 
(a) (b) (c) 


Figure 8-7 » The HMOs for the cyclopropenyl system: (a) $1 = (1/73) (x1 + x2 +. x3); (5) $2 = 
(1/2) (x1 — x2) (©) $; = (1/6) (x1 + x2 — 2x3). The nodal planes intersect the molecular plane 
at the dashed lines. 
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system display more and more nodal planes. Notice that the MOs ¢2 and $5 have the 
same number of nodal planes (one, not counting the one in the molecular plane) but 
that these planes are perpendicular to each other. This is a common feature of some 
degenerate, orthogonal MOs in cyclic molecules. 

It is important to notice the symmetry characteristics of these MOs. 4; is either 
symmetric or antisymmetric for every symmetry operation of the molecule. (It is 
antisymmetric for reflection through the molecular plane, symmetric for rotation about 
the threefold axis, etc.) This must be so for any nondegenerate MO. But the degenerate 
MOs 4» and $$ are neither symmetric nor antisymmetric for certain operations. (¢2 is 
antisymmetric for reflection through the plane indicated by the dashed line in Fig. 8-7, 
but is neither symmetric nor antisymmetric for rotation about the threefold axis by 
120?.) In fact, one can easily show that, given a cycle with an odd number of centers, 
each with one AO of a common type, there is but one way to combine the AOs (to form a 
real MO) so that the result is symmetric or antisymmetric for all rotations and reflections 
of the cycle. Hence, an HMO calculation for a three-, five, seven-, ... membered ring 
can give only one nondegenerate MO. However, for a cycle containing an even number 
of centers, the analogous argument shows that two nondegenerate MOs exist. 


[J 8-8 Charge Distributions from HMOs 


Now that we have a method that provides us with orbitals and orbital energies, it should 
be possible to get information about the way the zr-electron charge is distributed in the 
system by squaring the total wavefunction Wy. In the case of the neutral allyl radical, 
we have (taking v; to be a simple product of MOs) 


Vs = $1(0910)62) (8-42) 


Hence, the probability for simultaneously finding electron 1 in dv(1), electron 2 in 
dv(2) and electron 3 in dv(3) is 


V2 (1,2, 3dv(Ddv(2)dv(3) = 67 (1) 67 (2)65(3)dv(1)dv(2)dv(3) (8-43) 


For most physical properties of interest, we need to know the probability for finding an 
electron in a three-dimensional volume element dv. Since the probability for finding 
an electron in dv is the sum of the probabilities for finding each electron there, the 
one-electron density function p for the allyl radical is 


p=26, + 5 (8-44) 


where we have suppressed the index for the electron. If we integrate o over all space, 
we obtain a value of three. This means we are certain of finding a total 2 charge 
corresponding to three x electrons in the system. 

To find out how the z charge is distributed in the molecule, let us express p in terms 
of AOs. First, we write $1 and $2 separately: 


2 1 


2 1 2, 1 2, 1 1 l 1 
$1 F. 4% P T 46 T yan T we T 5X1 
1 1 
03 = -xf +=X% X1X3 (8-45) 


2 
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If we were to integrate p we would obtain 


cu. —1 —1 —0 
VAR op PI aa 
[otav= Lie dv4-- SEE fe dv+—= 5 | xoa 
J2 
—0 —0 
1 1 
+5 [ ots; [ oe 
1 1 1| 
=- —--] 8-46 
degna (8-46) 


Thus, one electron in $, shows up, upon integration, as being “distributed” 1 at 
carbon 1, 5 5 at carbon 2, and } z at carbon 3. We say that the atomic 1 -electron y 
due to an eion in $4 are i TE 7 at Cj, Co, and C3, respectively, If we accumulate 
these figures for all the electrons, we arrive at a total x -electron density for each carbon. 
For the allyl radical, Table 8-1 shows that each atom has a zr-electron density of unity. 

Generalizing this approach gives for the total z-electron density q; on atom i 


all MOs 


gi o Nee (8-47) 
k 


Here k is the MO index, cj; is the coefficient for an AO on atom i in MO kK, and nx, the 
“occupation number,” is the number of electrons (0, 1, or 2) in MO k. (In those rare 
cases where cj, is complex, cy in Eq. (8-47) must be replaced by c7, cis.) 

If we apply Eq. (8-47) to the cyclopropenyl radical, we encounter an ambiguity. If the 
unpaired electron is assumed to be in MO 4» of Fig. 8-7, we obtain q1 = q2 = 7, B= d 
On the other hand, if the unpaired electron is taken to be in $7, q1 —42 = A q3 = $. The 
HMO method resolves this ambiguity by assuming that each of the degenerate MOs is 
occupied by half an electron. This has the effect of forcing the charge distribution to 
show the overall symmetry of the molecule. In this example, it follows that q1 = q2 = 
q3 — 1. The general rule is that, for purposes of calculating electron distributions, the 
electron occupation is averaged in any set of partially occupied, degenerate MOs. 


TABLE 8-1 » HMO z Electron 
Densities in the Allyl Radical 


Carbon atom 


Electron 1 2 3 
lini 1 1 1 
rings 1 4 i 
3 in $» 1 0 1 


Sum 1 1 1 
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Figure 8-8 » When the equilateral structure is distorted by decreasing Aj? and increasing R13, 
R23, the energies associated with 1, $2. 6% shift as shown. 


In actuality, the equilateral triangular structure for the cyclopropenyl radical is unsta- 
ble, and therefore the above-described averaging process is only a theoretical ideal- 
ization. It is fairly easy to see that a distortion from equilateral to isosceles form will 
affect the MO energies E1, E», and EY differently. In particular, a distortion of the 
sort depicted in Fig. 8-8 would have little effect on E; but would raise E» (increased 
antibonding) and lower E? (decreased antibonding and increased bonding). Thus, there 
is good reason for the cyclopropenyl radical to be more stable in an isosceles rather 
than equilateral triangular form. This is an example of the Jahn- Teller theorem, which 
states, in effect, that a system having an odd number of electrons in degenerate MOs 
will change its nuclear configuration in a way to remove the degeneracy. The pref- 
erence of the cyclopropenyl radical for a shape less symmetrical than what we might 
have anticipated is frequently called Jahn-Teller distortion.^ 

Many times we are interested in comparing the zr-electron distribution in the bonds 
instead of on the atoms. In the integrated expression (8-46) are cross terms that vanish 
under the HMO assumption of zero overlap. But the overlaps are not actually zero, 
especially between AOs on nearest neighbors. Hence, we might view the factors 1/4/2 
as indicating how much overlap charge is being placed in the Cj—C2 and C2—C3 bonds 
by an electron in $1. The C4-C3 bond is usually ignored because these atoms are not 
nearest neighbors and therefore have much smaller AO overlap. Since $12 = $23 = Sj; 
for neighbors i and j in any x system (assuming equal bond distances), we need not 
include S;;, explicitly in our bond index. If we proceed in this manner, two electrons in 


Linear systems are exceptions to this rule. Problems are also encountered if there is an odd number of electrons 
and spin-orbit coupling is substantial. The reader should realize that the above statement of the theorem is a little 
misleading inasmuch as it makes it sound like the molecule finds itself in a symmetric geometry that produces 
denerate MOs and then “distorts” to a lower-energy geometry. It is actually we who have guessed a geometry that 
is too symmetric. When our calculations reveal that this results in degenerate orbital energies containing an odd 
number of electrons, we are alerted that we have erred in our assumption, and that the molecule is really in a less 
symmetric, lower energy geometry. 

^See Salem [1, Chapter 8]. 
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$ would then give us a “bond order” of 2/4/2 = 1.414. It is more convenient in practice 
to divide this number in half, because then the calculated zr-bond order for ethylene 
turns out to be unity rather than two. Since ethylene has one z-bond, this can be seen 
to be a more sensible index. 

As a result of these considerations, the z -bond order (sometimes called mobile bond 
order) of the allyl radical is 1/ 42 — 0.707 in each bond. (Electrons in $» make no 
contribution to bond order since c? vanishes. This is consistent with the nonbonding 
label for ¢2.) 

Generalizing the argument gives, for p;;, the z-bond order between nearest- 
neighbor atoms i and j: 


all MOs 
Pij = >D NKkCikC jk (8-48) 
k 


where the symbols have the same meanings as in Eq. (8-46). In cases in which partially 
filled degenerate MOs are encountered, the averaging procedure described in connection 
with electron densities must be employed for bond orders as well. 


EXAMPLE 8-4 Calculate p13 for the cyclopropenyl radical, using data in Fig. 8-7. 


SOLUTION » There are 2 electrons in $4 and the coefficients on atoms 1 and 3 are A so 
this MO contributes 2 x CRY — 2/3. We allocate 1 electron to $5. Since c3 = 0 in this MO, 


the contribution to p13 is zero. The remaining " electron goes to $3, yielding a contribution of 


Lyd -—— —-2 1-1 
US eg E US 6725 < 


[J 8-9 Some Simplifying Generalizations 


Thus far we have presented the bare bones of the HMO method using fairly small 
systems as examples. If we try to apply this method directly to larger molecules, it is 
very cumbersome. A ten-carbon-atom system leads to a 10 x 10 HMO determinant. 
Expanding and solving this for roots and coefficients is tedious. However, there are 
some short cuts available for certain cases. In the event that the system is too compli- 
cated to yield to these, one can use computer programs which are readily available. 

For straight chain and monocyclic planar, conjugated hydrocarbon systems, simple 
formulas exist for HMO energy roots and coefficients. These are derivable from the 
very simple forms of the HMO determinants for such systems.’ We state the results 
without proof. 

For a straight chain of n unsaturated carbons numbered sequentially, 


—2cos[kz/(n 4-1), k-—1,2,...,n (8-49) 
[2/(n 4- 1]? sin[kLzr / (n + 1)] (8-50) 


Il 


x 


Clk 


where / is the atom index and k the MO index. 


5See Coulson [2]. 
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For a cyclic polyene of n carbons, 


x = —2cosQxk/n), k=0,1,...,n-1 (8-51) 
cj =n" exp[2zik(l —1)/n], i=V—1 (8-52) 


The coefficients derived from Eq. (8-52) for monocyclic polyenes will be complex 
when the MO is one of a degenerate pair. In such cases one may take linear combinations 
of these degenerate MOs to produce MOs with real coefficients, if one desires. 

There is also a diagrammatic way to find the energy levels for linear and monocyclic 
systems.Ó Let us consider monocycles first. One begins by drawing a circle of radius 
2 ||. Into this circle inscribe the cycle, point down, as shown in Fig. 8-9 for benzene. 
Project sideways the points where the polygon intersects the circle. The positions of 
these projections correspond to the HMO energy levels if the circle center is assumed 
to be at E — o (see Fig. 8-9). The number of intersections at a given energy is identical 
to the degeneracy. The numerical values for E are often obtainable from such a sketch 
by inspection or simple trigonometry. 

For straight chains, a modified version of the above method may be used: For an n- 
carbon chain, inscribe a cycle with 27 + 2 carbons into the circle as before. Projecting 
out all intersections except the highest and lowest, and ignoring degeneracies gives the 
proper roots. This is exemplified for the allyl system in Fig. 8-10. 

Examination of the energy levels in Figs. 8-9 and 8-10 reveals that the orbital energies 
are symmetrically disposed about E =a. Why is this so? Consider the allyl system. 
The lowest-energy MO has two bonding interactions. The highest-energy MO differs 
only in that these interactions are now antibonding. [See Fig. 8-5 and note that the 
coefficients in $4 and $3 are identical except for sign in Eqs. (8-27) and (8-29).] The 
role of the B terms is thus reversed and so they act to raise the orbital energy for $5 just 
as much as they lower it for $1. A similar situation holds for benzene. As we will see 
shortly, the lowest energy corresponds to an MO without nodes between atoms, so this 
is a totally bonding MO. The highest-energy MO has nodal planes between all neighbor 
carbons, and so every interaction is antibonding. An analogous argument holds for the 
degenerate pairs of benzene MOs. These observations suggest that the energy of an 
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Figure 8-9 » HMO energy levels for benzene produced by projecting intersections of a hexagon 
with a circle of radius 2 |]. 


6See Frost and Musulin [3]. 
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Figure 8-10 » HMO energy levels for the allyl system (n = 3) produced by projecting the inter- 
sections of an octagon (n = 2 x 3 + 2) with a circle of radius 2 |£]. 


E,-a— 20 


MO should be expressible as a function of the net bond order associated with it, and 
this is indeed the case. The energy of the ith MO is given by the expression 


E = J $1 roi dv= J ooa Hs Y cuxidv (8-53) 
k l 


= Sy ones: J xi Él, xi dv (8-54) 
ies al 


When the atom indices k and / are identical, the integral is equal to œ; when k and / are 
neighbors, it equals 6. Otherwise it vanishes. Hence, we may write 


neighbors 


Ej) cac D> cucuB (8-55) 
k k,l 


However, E is qk,i, the electron density at atom k due to one electron in MO 6j, 
and CķiCļi iS pyj,;, the bond order between atoms k and / due to an electron in $j. 
Therefore, 


neighbors 


Ei= quio2 M. pui (8-56) 
k 


k«l 


We have seen that the sum of electron densities must equal the total number of electrons 
present. For one electron in jj, this gives additional simplification. 


bonds 


E; 2a *28 Y pu; (8-57) 
k«l 


The total x -electron energy is the sum of one-electron energies. For nz electrons 


bonds 


E, =na+2B Y^ pu (8-58) 
k«l 


where px is the total zz -bond order between neighbors k and /. Hence, the individual 
orbital energies directly reflect the amount of bonding or antibonding described by 
the MOs, and the total energy reflects the net bonding or antibonding due to all the z 
electrons together. 
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EXAMPLE 8-5 The total m energy of cycloheptatrienyl radical (C7H7) is 7a + 


8.54296. What is the bond order for any bond in this molecule, assuming it to be 
heptagonal? 


SOLUTION » The total z-bond order must be 8.5429/2 or 4.2714. This results from seven 
identical bonds, so each bond order is 4.2714/7 — 0.6102. 4 


Does this pairing of energy levels observed for allyl and benzene always occur? It 
is easy to show that it cannot in rings with an odd number of carbon centers. Consider 
the cyclopropenyl system. The lowest-energy MO is nodeless, totally bonding and has 
an energy of 2a 4- 28. [Note from Eq. (8-51) and also from the diagram method that 
every monocyclic system has a totally bonding MO at this energy.] To transform these 
three bonding interactions into antibonding interactions of equal magnitude requires 
that we cause a sign reversal across every bond. This is impossible, for, if c, disagrees 
in sign with c? and c5, then c? and c3 must agree in sign and cannot yield an antibonding 
interaction. 

Not surprisingly, this has all been considered in a rigorous mathematical fashion. 
Systems containing a ring with an odd number of atoms are “nonalternant” systems. All 
other homonuclear unsaturated systems are "alternant" systems. An alternant system 
can always have asterisks placed on some of the centers so that no two neighbors are 
both asterisked or unasterisked. For nonalternants, this is not possible (see Fig. 8-11). 
It is convenient to subdivide alternant systems into even alternants or odd alternants 
according to whether the number of centers is even or odd. With this terminology 
defined, we can now state the pairing theorem and some of its immediate consequences. 

The theorem states that, for alternant systems, (1) energy levels are paired such that, 
for each level at E =a + kf there is a level at E =a — kf; (2) MOs that are paired 
in energy differ only in the signs of the coefficients for one of the sets (asterisked or 
unasterisked) of AOs. 

It is easy to see that an immediate result of this theorem is that an odd-alternant 
system, which must have an odd number of MOs, must have a nonbonding (E = o) 
MO that is not paired with another MO. It is also possible to show that the electron 
density is unity at every carbon for the neutral ground state of an alternant system. The 
proofs of the pairing theorem and some of its consequences are given in Appendix 5. 

Another useful short cut exists that enables one to sketch qualitatively the MOs 
for any linear polyene. The HMOs for the allyl and butadiene systems are given in 
Fig. 8-12. Notice that the envelopes of positive (or negative) phase in these MOs are 
similar in appearance to the particle in a one-dimensional “box” solutions described 
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Figure 8-11 » (a) Even and (b) odd alternants have no two neighbors identical in terms of an 
asterisk label. (c) Nonalternants have neighbors that are identical. 
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Figure 8-12 » MOs for the allyl and butadiene systems. The dashed lines emphasize the similarity 
between an envelope, or contour, of positive $; for these systems and the particle in a one-dimensional 
box solutions. 


in Chapter 2. This similarity makes it fairly easy to guess the first few MOs for 
pentadienyl, hexatriene, etc. Also, if one knows the lowest-energy half of the MOs for 
such molecules, one can generate the remaining MOs by appeal to the second part of 
the pairing theorem. (The edges of the one-dimensional box should extend one C-C 
bond length beyond the terminal atoms.) 

As we consider larger systems, brute-force solution of determinantal equations 
becomes too labor-intensive, and so such cases are always either solved on a computer’ 
or else by appeal to HMO tabulations in print. However, the above generalizations 


continue to be useful in understanding the MO results. 


8-10 HMO Calculations on Some Simple Molecules 


Thus far, we have used the allyl and cyclopropenyl systems as examples. We will now 
describe the results of HMO calculations on some other simple but important systems. 


8-10.A Ethylene (Even Alternant) 
The Hiickel determinantal equation is 


1 
x 20 
1 x 


"Many types of quantum-chemical computer programs are available from: Quantum Chemistry Program 
Exchange, Chemistry Department. Indiana University, Bloomington, Indiana 47401. On the Internet at 
http://www.QCPE.Indiana.edu/ 

8See Coulson and Streitwieser [4], Streitwieser and Brauman [5], and Heilbronner and Straub [6]. See also 
Appendix 6 of this text. 
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and so x? — 1 20; x = +1, —1. The resulting orbital energies and coefficients are 


1 1 
Ei = a + B, ġ1 = x1 + x2 
v2 v2 

1 1 
—EXl-— —xXx2 
J/2 J2 
These, with the ground state electronic configuration indicated, are shown in (II). 
7 -Electron densities and zr-bond order are indicated in the diagram beneath the MO 
sketches. Ethylene is an even alternant, so it has paired energies, unit electron densities, 
and coefficients related by a sign change. 


E2 =Q — p, þ2 = (8-59) 


Se E;—-a-—f e 
E, = 2a -200p  , 31. 
1 1 
ai) 
8-10.B Butadiene (Even Alternant) 
1 2 3 4 x 100 
1x 1 0] _ 0 
(^d xd be. 
00 1x 


This problem can be solved by expansion to a polynomial in x and factoring, but it 
is simpler to use Eq. (8-49) or the decagon in a circle of radius 2|8|. The coefficients 
are obtainable from Eq. (8-50). The results are 


E, = « — 1.6188, E;=«—06188, E, — a + 0.618f, 
E, =a + 1.6186, — E, — da + 4.472ß 
$, = 0.372%, + 0.60272 + 0.602% + 0.3725, 
(4) (7) (7) 


$3, 7 06024, + 0.3725, — 0372y, — 0.602x4 (8-60) 
(7) (+) 


0.894 0.447 0.894 
oeoo’ 
1 1 1 1 


The MOs are given in Fig. 8-12. $1 is bonding in all bonds, ¢2 is bonding in the 
outer bonds, antibonding in the central bond. As a result, butadiene has a lower 2 -bond 
order in the central bond than in the outer bonds. This is in pleasing accord with the 
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experimental observation that the central bond in butadiene is significantly longer than 
the outer bonds. 

The formal structural formula for butadiene (III), indicating two pure double bonds 
and one pure single bond, is clearly not an adequate description since we have just 


t 


(III) 


found the central bond to have some zr-bonding order (0.447) and the outer bonds to 
be less x bonding than ethylene (0.894 as opposed to 1.000). The MO parlance is that 
the x electrons in butadiene are delocalized over the entire carbon system rather than 
being restricted to the formal double bonds only. 

Because the HMO method allows for no interaction between terminal carbons 
(i.e., Hı 4 — 0), there is no distinction between cis- and trans-butadiene in this calcu- 
lation. However, if a weak interaction were postulated, it is not difficult to see that $4 
would give a bonding end-to-end contribution, $» a substantially larger antibonding 
interaction, leading to a prediction (in agreement with experiment) that trans-butadiene 
is the more stable form. 


8-10.C Cyclobutadiene (Even Alternant) 


If we assume that this system is perfectly square, we can inscribe the square in a 
circle, giving us the orbital energies immediately (IV). 


E4 = a — 2p 


E, = « + 2B 1 = 4a + 4B 


v 
" 


(V) 


Since $4 and $4 are nondegenerate, they must be symmetric or antisymmetric for 
the various rotations and reflections of the molecule. Also, $4 and $4 must have the 
same coefficients, except for sign changes. 

It follows at once that 


1 1 1 1 1 1 1 1 


PL = 5X1 F3» t 5X3 + 3X8 $4— 5X1 jX0t 308 5X4 


h2 and $3 are degenerate, so there is some arbitrariness here. However, we expect each 
of these MOs to have a nodal plane, and these planes should be perpendicular to each 
other if $» and $3 are to be orthogonal. Therefore, we choose the pair having nodal 
planes indicated by dashed lines (V). The four MOs for cyclobutadiene as seen from 
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above are shown in (VI). It is clear that $» and $3 are nonbonding because the nodal 
planes prevent interactions between neighbors, but it looks like $» and $3 violate the 
pairing theorem since they cannot be interchanged by changing signs of coefficients. 
This is only an apparent violation, because it is easy to find an equivalent pair of MOs 
that follow the rule. We need only choose nodal planes that are rotated 45? (VII) from 
those we selected previously. These MOs are linear combinations of $» and $3 and 
are equivalent to them for purposes of calculating electron densities and bond orders. 
They are still nonbonding MOs (Æ still equals o), but now it is not because of a nodal 
plane preventing nearest-neighbor interactions, but because bonding and antibonding 
interactions occur in equal number and magnitude. 

Notice that this is an example of an even alternant system with nonbonding MOs. 
Thus, whereas an odd alternant system must have an unpaired nonbonding MO, even 
alternants may have nonbonding MOs in pairs. 


2/7 M1 2 
rd "S 
»^ N 
14 3 4 3N 2j 
$3 = (1/2) — (01 V2)xs $s = (1/V2)xa — (1/2). 
(V) 


COD * O° 
Oe OO. Oo 


eae TT 


: x $3 $4 
(VD 
1 2 
4 3 
do = fxi — ixs — dys + txa $s = ha + tx. — ixs — 4x4 
(VII) 


Since this system is alternant, it must have z-electron densities of unity in its ground 
neutral state. This would be necessary however, even if the molecule were not alternant, 
due to the fact that all carbons are equivalent by symmetry. Hence, they must all have 
the same electron density. Since it must sum to four electrons, the density of each atom 
must be unity. The same argument applies to the cyclopropenyl radical, a nonalternant 
that, nevertheless, has all electron densities equal to unity (1f the unpaired electron is 
divided between degenerate MOs). 

Our assumed square symmetry also requires all four bonds to be identical. Since 
the total energy, 4o +4, is related to bond order through Eq. (8-58), it follows at once 
that the total bond order is 2, and so each bond has order 1. 


EXAMPLE 8-6 We have assumed that cyclobutadiene is square. Could it be 


otherwise? 
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SOLUTION » Square cyclobutadiene leads to degenerate, nonbonding pi MOs, each of which 
we are assuming to contain one electron. (See IV.) We have further assumed that these two electrons 
have the same spin. This gives a triplet state (assumed to be the lower-energy choice) which forces 
us to put each electron into a different MO. If cyclobutadiene were distorted into a rectangle by 
shortening bonds 1—2 and 3-4, and lengthening 2-3 and 1-4, we would expect ’5 of VII to rise in 
energy and $5 to drop. If we continue having one electron in each MO, these changes would tend to 
cancel but, if we put both electrons into $5, the z energy would drop. However, this would require 
paired spins—a singlet. We have here, then, a competition between two factors: the energy rise of 
forming a singlet and the energy lowering from being distorted to a rectangle. HMO theory doesn't 
tell us which factor wins, but it does alert us to a possible extension to the Jahn-Teller effect: An 
odd number of electrons (one or three) in a doubly-degenerate level must cause distortion. An even 
number (two) may cause distortion if the destabilization caused by going from triplet to singlet is 
more than counterbalanced by the stabilization resulting from “distortion.”? < 


8-10.D Benzene (Even Alternant) 


Benzene is another molecule whose high symmetry enables one to use shortcuts. The 
orbital energies have already been found from the hexagon-in-a-circle diagram (see 
Fig. 8-9). The lowest- and highest-energy MOs must show all the symmetry of the 
molecule, as they are nondegenerate. Therefore, 


1 
$1— (xi aoo aas ea) 
6 X6N QO ©) (-) 


where the carbon atoms are numbered sequentially around the ring. 

The degenerate MOs $» and ¢3 should have one nodal plane each, and these should 
be perpendicular to each other. If we take one plane as shown in (VIII), we can 
immediately write down ¢2. The node for $3 is given in (IX). It is obvious that xe, 
X1, X2 have coefficients of the same sign, and that c? = cg = —c5 = —c3, and also 
cı = —c4. However, c, need not equal c» as these atoms are differently placed with 
respect to the nodal plane. To determine these coefficients, we will use the fact that 
the neutral ground-state zr densities are all unity in this system. We consider first atom 
number 1. Its electron density due to two electrons in ¢; and two electrons in $» is 
£+4+0+0=}. 


6 2 6 2 
=, + —- — 7 p 
: : f ion + xa = xs — xe) f ; 


(VIII) (IX) 


9 Experiments and high-accuracy calculations indicate that the lowest-energy state for cyclobutadiene is a rect- 
angular singlet. Surprisingly, calculations of high accuracy also indicate that, even in the square geometry, the 
1B 1g singlet state (corresponding to each degenerate z MO containing one electron, but with spins paired) lies 
below the triplet in energy—the opposite of what Hund’s rule predicts. This is thought to result from energy 
lowering due to spin polarization. For a detailed review of studies on cyclobutadiene, see Minkin et al. [19]. 

10This statement applies to the real forms of $? and $3. The complex forms [derivable from Eq. (8-52)] do not 
have a planar node. The situation is analogous to the 2p, ;, 2p_ versus 2px, 2py orbitals for the H atom. 
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Figure 8-13 » > for benzene and 4, for ethylene have the same HMO energy. The MOs are 
sketched as seen from above. 


Therefore, two electrons in $3 must produce a contribution of Z. Hence the coeffi- 


cient for this atom in $3 must be 1/4/3. A similar argument for atom 2 gives a coefficient 
of 1/4/12 (or, one can use the normality condition for $3). As a result, 


1 1 1 1 1 
o ET (x14 520 3%3 x4 pst gu) 
Appeal to the pairing theorem generates $4 and $s as indicated. 

By symmetry, all bond orders must be identical, and their sum must be 4, since 
E —6a 4-88. Therefore, pj2 = po3—---— p61 = 4 — 0.667. 

The hexagon in a circle applies to ethylene (n — 2, 2n +2 = 6) as well as to benzene. 
As a result, the orbital energies for $2», $3 of benzene are identical to E; for ethylene. 
Examination of these MOs (Fig. 8-13) makes the reason for their energy agreement 
clear. Molecular orbital ¢2 of benzene is more revealing than is $3 in this context. 
The nodal plane produces an MO corresponding to two noninteracting ethylene MOs. 
Hence, an electron in this MO is always in a situation that is indistinguishable (under 
HMO approximations) from that in $; of ethylene. 

In this section, we have tried to illustrate some of the properties of HMO solutions 
for simple systems and to indicate how symmetry and other relations are useful in 
producing and understanding HMO results. It is often convenient to have HMO results 
for various simple systems readily available in a condensed form. Therefore, a summary 
of results for a number of molecules is provided in Appendix 6. 


8-11 Summary: The Simple HMO Method for 
Hydrocarbons 


1. The assumption is made that the zr-electron energy can be minimized independently 
of o electrons. This is an approximation. 


2. The assumption is made that each v electron sees the same field (the repulsion due to 
the other z electrons is presumably included “in effect,” in a time averaged way) so 
that the x electrons are treated as independent particles. This approximation leads 
to a total wavefunction that is a simple product of one-electron MOs and a total z 
energy that is a sum of one-electron energies. Except for use of the Pauli principle 
to build up configurations, no explicit treatment is made of electron spin. 


3. The basis set is chosen to be a 204 AO from each carbon atom in the unsaturated 
system. Choosing a basis set of AOs means our MOs will be linear combinations 
of AOs, and so this is an LCAO-MO method. 
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4. The Hückel determinant summarizes the connectedness of the unsaturated system, 
and is independent of cis—trans isomerism or bond length variation. 


5. The energy of each MO is expressed in terms of atomic terms, œ, and bond terms, f. 
The amount of o in each MO energy is always unity because the sum of x -electron 
densities for one electron in the MO is always unity. The amount of f present is 
related to the net bonding or antibonding character of the MO. 


6. Alternant systems display paired energy levels and corresponding MOs having coef- 
ficients related by simple sign reversals. For ground-state neutral alternants, the 7r 
electron densities are all unity. 


7. Acaveat: Onecan perform HMO calculations on very large systems such as pentahe- 
licene (X) thereby making the implicit assumption that this is a planar molecule. But 
repulsion between protons on the terminal rings is sufficient to cause this molecule 
to deviate from planarity. Similarly, cyclooctatetraene is tub-shaped, probably due 
to considerations of angle strain that would occur in the planar molecule. Hence, 
one must recognize that, in certain cases, an HMO calculation refers to a planar 
“ideal” not actually achieved by the molecule. 


(X) 


NH 8-12 Relation Between Bond Order and Bond Length 


In this and following sections we will describe some of the relations between HMO 
theoretical quantities and experimental observations. !! 

It is natural to look for a correlation between calculated zr-bond orders and exper- 
imentally determined bond lengths. A high bond order should correspond to a large 
x charge in the bond region, which should yield a shorter, stronger bond. The bond- 
order-bond-length results for certain simple systems, given as a graph in Fig. 8-14, 
do indeed show the anticipated behavior. However, as more and more data are added 
(Fig. 8-15) it becomes clear that an exact linear relation between these quantities does 
not exist at this level of refinement. 

Nevertheless, the correlation between bond order and bond length is good enough 
to make it useful for rough predictions of bond length variations. An example of this 
is given in Fig. 8-16, where calculated and observed bond lengths for phenanthrene are 
plotted. Even though the predicted absolute values of bond lengths are imperfect, the 
theoretical values show a rough parallelism with observed changes in values as we go 
from bond to bond. The relation used here to calculate the theoretical bond lengths 
from HMO bond orders is due to Coulson [8] and has the form 


s—d 


RES. 
1+k(1— p)/p 


(8-61) 


lFor a more complete discussion of these phenomena as well as many others, see Streitwieser [7]. 
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Figure 8-14 »  ;r-Bond order versus bond length for some simple unsaturated hydrocarbons. 


147 


7 - bond order 


Figure 8-15 » Bond lengths versus x -bond orders for benzene, graphite, naphthalene, anthracene, 
phenanthrene, triphenylene, and pyrene. 


Here, s is the single bond length, d the double bond length, p the z-bond order, k an 
adjustable parameter, and R the predicted length. The double bond length d is taken 
to be 1.337 À, the bond length of ethylene. The single bond length may be taken to be 
the length of the C-C bond in ethane, 1.54 A, or it may be set by fitting to data points 
(such as those in Fig. 8-15) on the assumption that the single bond between two CH»? 
groups (i.e., ethylene with its zr bond “turned off") is not necessarily the same length 
as the single bond between two CH3 groups. Both of these alternatives have been used 
in Fig. 8-16. Other mathematical forms have also been suggested. Because of the 
scatter in the data, there is little basis for preferring one formula over another. It seems 
generally true that all the proposed relationships work best for bonds in condensed ring 
systems, and most poorly for bonds in acyclic polyenes (e.g., butadiene) or between 
rings (e.g., biphenyl). 
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Figure 8-16 » Theoretical versus experimental bond lengths for phenanthrene in order of decreas- 
ing observed length: (e) experimental; (x) theoretical with s = 1.54 Å, k= 0.765; (o) theoretical 
with s — 1.515 À, k= 1.05. 


E 8-13 z-Electron Densities and Electron Spin Resonance 
Hyperfine Splitting Constants" 


If a molecule with one or more unpaired electron spins is placed in an external magnetic 
field, the interaction energy of an unpaired electron's spin dipole with the external field 
will be different depending on whether the electron's spin is œ or 8. This produces two 
slightly different energy levels for an unpaired electron. Suppose microwave radiation 
is supplied in an effort to excite such electrons from the lower to the higher of these two 
levels. If the radiation's energy is larger than required, no absorption occurs. But, if we 
gradually increase the strength of the magnetic field, thereby increasing the energy-level 


124 number of reviews on this subject have been published. See, for example, Gerson and Hammons [9]. 
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gap, we eventually achieve a magnetic field where absorption of the microwave energy 
occurs. This would give us a spectrum (absorption vs. magnetic field) with just one 
line. 

Now suppose that our radical has a hydrogen atom attached to it. The proton nucleus 
of the hydrogen can have a spin of œ or f, and this can have a small effect on the 
energies of the electron whose spin we are trying to flip. We find now that half of the 
radicals absorb at a slightly different applied magnetic field than the other half giving us 
atwo-line spectrum: The original single line now exhibits Ayperfine splitting due to the 
presence of a hydrogen atom, and the distance between the two peaks on the magnetic 
axis is called the hyperfine splitting constant, usually given in millitesla (mT), or in 
gauss (G). 

Ifthe unpaired electron is a x electron, HMO theory can be applied to the value of the 
ESR splitting constant. The interaction between an unpaired m electron and a proton 
(or other nucleus with nonzero nuclear spin) falls off rapidly with increasing separation. 
Therefore, the hyperfine structure is generally ascribed to interactions involving protons 
directly bonded to carbons in the m system (o protons) or else separated from the z 
system by two ø bonds (P protons). The equilibrium position of an o proton is in the 
nodal plane of the x system, so it is clear that any net spin density at the proton must 
be only indirectly due to the presence of an unpaired x electron. This indirect effect 
arises because the unpaired z electron interacts slightly differently with œ- and -spin 
o electrons on carbon, and so the spatial distributions of these become slightly different 
in the c MOs, ultimately producing net spin density at the proton; the o electrons are 
said to be spin polarized by the x electron (see Fig. 8-17). 

The extent of spin polarization at a given hydrogen should depend on the percent- 
age of time the unpaired zr electron spends on the carbon to which that hydrogen is 
bonded. It is therefore reasonable to look for relationships between the distribution of 
zx -spin density in a radical and the hyperfine coupling constants characterizing its ESR 
spectrum. The simplest assumption one can make in this regard, called the McConnell 
relation, is that the hyperfine splitting constant, a7;,, for a proton directly bonded to 
the uth carbon, is proportional to the net z-spin density p, on that carbon: 


an, = Qpu (8-62) 


Thus, if the unpaired z electron density is twice as great on carbon 1 as on carbon 2, 
the ESR splitting constant for a proton attached to carbon 1 should be twice as great as 
that for a proton attached to carbon 2. 


OCT) 


Figure 8-17 » The unpaired z-spin density at carbon repels both ø electrons in the C-H bond 
region, but does not repel them equally. As a result, slight spin imbalance due to o electrons occurs 
at the proton. 
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The HMO method predicts that the spin density at the uth carbon due to an unpaired 


electron in the mth MO is simply [m |’. This is only a first approximation to py, 
but a graph of observed splitting constants plotted against p„ calculated in this way 
shows a respectable correlation (Fig. 8-18). The proportionality factor Q, given by the 
slope of the line of best fit, varies somewhat depending on the type of system and the 
charge of the radical. Thus, in Fig. 8-18, where all the data are from aromatic fused ring 
hydrocarbon radical anions, the correlation is quite good. As data for nonalternants 
and radical cations are added (Fig. 8-19) the scatter increases. 

Some ESR spectra are interpreted to be consistent with the presence of some negative 
spin density. That is, if the extra x electron is taken to have o spin, some of the carbons 
appear to have an excess of B-spin x density. This has the effect of giving us a total spin 
density that is greater than unity, even though we have only one electron with unpaired 
spin. (It sounds absurd, but read on.) Now the amount of splitting seen in ESR spectra 
depends only on the magnitudes of spin densities at the protons, not on whether these 
spin densities are œ or f, and the presence of negative spin density does not produce 
a qualitative change in an ESR spectrum (such as a "negative splitting,” whatever that 
might be). Rather it leads to an increase in the total amount of spin density in the 
system, and this, in turn, leads to an increase in the sum of splitting constants (over 


9H 


c? X 100 


Figure 8-18 » ESR splitting constants a Ay in gauss versus HMO unpaired spin densities. The sys- 
tems are fused ring alternant hydrocarbon radical anions (naphthalene, anthracene, tetracene, pyrene). 
The underlined point is thought to result from negative spin density. (Data from Streitwieser [7].) 
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Figure 8-19 » ESR hyperfine splitting constants in gauss versus c? for the highest occupied MO 
of the radical. All data are from hydrocarbon radical anions or cations of alternant or nonalternant 
type. (e) Anion radical; (x) cation radical. Uncertain assignments: (o) anion; (&) cation. The two 
correlation lines are merely sight fitted to the anion and cation points separately and suggest that Q 
for cations should be larger than for anions. Points thought to result from negative spin density are 
underlined. [See Tables I, III, VIII, XII, XIII, XIV, XV, XVI, XVII, XVIII of Gerson and Hammons 
[9], and Table 6.2 of Streitwieser [7] for data plotted here.] 


what would be predicted in the absence of negative spin density). For example, the 
HMO prediction for allyl radical would give a net spin density of 1 at each terminal 
carbon and zero at the central carbon. [The unpaired electron is in the nonbonding MO 
^ X1— 7 xa.] However, we might imagine the situation wherein there is a net spin 


of Sal at each terminal carbon and i B at the central atom. This retains a net spin value 
of læ yet results in a larger splitting constant ay, for every proton in the molecule. 
It is possible to think of a physical explanation for this kind of spin distribution. The 
m electrons in lower-energy, filled MOs are being spin polarized similarly to the o 
electrons mentioned earlier. If an o-spin electron in effect repels electrons of 6 spin 
more strongly than those of o spin, a buildup of 6 spin on the central carbon of allyl 
radical could be expected. Because this secondary effect is expected to be fairly small, 
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a net negative spin density is likely only on carbons where the primary effect (Fm) is 


zero or quite small. Methods for calculating this spin polarization (which will not be 
described here) indicate that certain of the splitting constants plotted in Figs. 8-18 and 
8-19 do in fact result from negative spin densities. These data points are indicated in the 
figures, and it can be seen that they do indeed occur where Cim and ay, are small. This 
secondary effect is masked at higher values of cone and a#,,. but presumably accounts 
for some of the scatter in the data. (Some scatter also results from solvent dependence 
of ay,,. Several solvents were used in experiments yielding the data in Fig. 8-19.) 

Because of the pairing-theorem relation between coefficients of MOs, the ESR spec- 
tra of the radical cation and anion of an alternant system should appear very similar. 
Hückel molecular orbital theory would predict them to be identical except for a slight 
change of scale due to a change in the factor Q. In practice, this similarity has been 
observed to hold fairly well. 


E 8-14 Orbital Energies and Oxidation-Reduction Potentials 


Many conjugated hydrocarbons can be oxidized or reduced in solution using standard 
electrochemical techniques. Since oxidation involves removing an electron from the 
highest occupied (77) MO (HOMO), it is reasonable to expect molecules with lower- 
lying HOMOs to have larger oxidation potentials. Similarly, we might expect reduction 
to be easier for compounds wherein the lowest unoccupied MO (LUMO) is lower in 
energy [see (XI) ]. Thus, compound A should have a lower oxidation potential than com- 
pound B since E4 > EP, but compound B should have the lower reduction potential. 
A plot of oxidation potential versus Em in units of £ for a series of aromatic hydro- 
carbons (Fig. 8-20) yields a correlation that is remarkably free of scatter. Figure 8-21 
indicates a similar correlation for reduction potential versus E. 


En si 


Compound A Compound B 
(X1) 


Since oxidation-reduction potentials correlate with HOMO-LUMO energies, and 
since these energies are paired in even alternant systems, we should expect a plot of 
oxidation vs. reduction potential for such compounds to be linear also. (Problem 8-14.) 

The data in Figs. 8-20 and 8-21 provide a connection between theoretical energy 
differences in units of 8, and experimental energies. From the slope in Fig. 8-20 we 
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Figure 8-20 » Oxidation potentials in acetonitrile solution versus energy of HOMO (in units of B). 
(€1/2 from Lund [10].) 


obtain that B = —2.03eV, or —46.8 kcal/mole. Similarly, the reduction potential data 
of Fig. 8-21 give 8 = —2.44eV = —56.3 kcal/mole. 

One must be cautious in interpreting the above values of B. The problem is that 
the experimental numbers include effects of physical processes not included in the 
theory. For example, when a neutral molecule in solvent becomes oxidized or reduced, 
solvation energy changes occur. One might argue that the fit of the data to straight lines 
in Figs. 8-20 and 8-21 implies this sort of contribution to be small, but the sensitivity 
of redox potentials to solvent nature indicates that this is not the case. However, in 
larger molecules, solvation energy change upon ionization tends to be smaller, and 
larger molecules also tend to have Em and Em+1 closer to the E =a level. In other 
words, we expect both solvation energy change and redox potential to be proportional 
to E, or Em+1. Therefore, they can be combined in a single linear relation. There 
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Figure 8-21 » First reduction potential in 2-methoxyethanol versus energy of LUMO (in units 
of B). («1/2 from Bergman [11].) 


is, in general, no guarantee that "extra" contributing effects will be of a nature to be 
correctly assimilated into the theoretical formula, but, for this effect, it happens that 
this is at least partially the case. 

Another extra effect to consider is the zr-electron repulsion energy. Because the 
energies of the HOMO and LUMO are both associated with the neutral molecule, their 
energies fail to reflect the decrease or increase of x -electron repulsion energy resulting 
from loss or gain of a x electron. Here again, however, we expect the magnitude of the 
effect to be larger for smaller molecules, where the change in x densities is greatest, 
and also where Em and Em+1 deviate most from a. Thus, as before, this extra effect 
will not necessarily upset the linear relation expected from simpler considerations. 

In view of the crudity of the HMO method together with the fact that the empirical 
value of 6 includes the effects of several extra processes, the 6 values cited above 
for oxidation and reduction are considered to be in fairly good agreement, especially 
considering that the electron repulsion change operates to make oxidation easier and 
reduction harder. 
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8-15 Orbital Energies and lonization Energies 


Suppose that monochromatic light is beamed into a gaseous sample of a compound. 
If the light is of sufficient energy, electrons will be “knocked out" of the molecules. The 
kinetic energy of such a photoelectron will be equal to the kinetic energy of the incident 
photon (Av) minus the energy needed to remove the electron from the molecule, that is, 
the ionization energy. Measurement of the kinetic energies of photoelectrons emitted 
in this manner is known as “photo-electron spectroscopy” [12]. 

In measuring the kinetic energies of photoelectrons from, say, benzene, it is 
found that a large number are near a particular energy value, another large number 
of electrons are near a different value, and so on for several kinetic energy values. 
Because the photoelectrons tend to clump near several kinetic energy values, it fol- 
lows that we are, in effect, measuring several ionization energies. It is reasonable 
to associate these with removal of electrons from different MOs of the molecule.!* 
A correlation plot (Fig. 8-22) between HMO orbital energies and experimentally 
measured ionization energies for a number of alternant and nonalternant hydrocarbons 
has been produced by Brogli and Heilbronner [13]. Their best fit was achieved using 
a = —6.553 + 0.340 eV, B = —2.734 + 0.333 eV, where the limits define a range for 
the predicted ionization energy (IE) that will include the experimental value nine 


HMO values for IE (eV) 


7 8 9 10 11 12 13 
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Figure 8-22 » Experimental ionization energies for alternant and nonalternant hydrocarbons versus 
HMO orbital energies using œ = —6.553 eV, p = —2.734 eV. 


!3Note that each ionization energy is associated with removal of an electron from a different MO of the neutral 
molecule. This differs from the first, second, etc. ionization energies produced by successive ionization. 
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times out of ten (a 90% confidence level). There is a fair degree of scatter in the 
correlation plot. By making some additional refinements in the theory, Brogli and 
Heilbronner succeeded in substantially improving the correlation. We will describe 
these refinements in Section 8-18. 

In this section, we have dealt only with ionization energies assigned to removal of 
az electron. Removal of electrons from o orbitals is also observed. If the impinging 
photons are of X-radiation frequencies, the inner shell electron ionization energies are 
seen. Quantum-chemical treatments of these ionization energies are possible using 
theoretical methods described later. 


E 8-16 z-Electron Energy and Aromaticity 


When propene is hydrogenated to form propane, the increase in heat content A Hog 
(called the heat of hydrogenation) is —30.1 kcal/mole. The heat of hydrogenation for 1- 
butene is —30.3 kcal/mole. In general, the heat of hydrogenation of an isolated double 
bond is about —30 kcal/mole. A similar constancy holds for the contribution of a double 
bond to the heat of formation of a molecule. Therefore, isolated double bonds fit easily 
into the usual chemical device of estimating the energy of a molecule by adding together 
contributions from the substituent parts. This additivity appears to be violated when 
double bonds are conjugated. Thus, the heat of hydrogenation for trans-1,3-butadiene 
is —57.1 kcal/mole compared with the value of —60.6 kcal/mole for a pair of butene 
double bonds. Since the hydrogenation product in each case is butane, the energy differ- 
ence of 3.5 kcal/mole must be due to the greater stability of conjugated double bonds. 

There is a theoretical parallel to this. The HMO energy for butadiene is 4o + 4.4726. 
Fora pair of isolated double bonds, we double the energy of ethylene to obtain 4a + 46. 
Therefore, the HMO method indicates that the conjugated double bonds are stabilized 
by 0.4728. 

Because the z electrons in butadiene are delocalized over all three C-C bonds, this 
0.472p has often been referred to as delocalization energy. It was common practice for 
many years to equate this theoretical delocalization energy for a molecule to its exper- 
imentally measured "extra" stability (e.g., for butadiene, 0.472 |B| — 3.5 kcal/mole). 

In 1969, Dewar and co-workers [14, Chapter 5; 15; 16] demonstrated that conjugated 
double bonds may be successfully included in an additivity scheme. Hess and Schaad 
[17, 18] have considered this idea in the context of the HMO method. They distinguish 
between several kinds of C-C single and double bond energy as indicated in Table 8-2. 
Using these values of bond energy, the zr energy of butadiene is calculated to be 2 x 
2.0008 + 0.46608 + 4a = 4a + 4.46606 compared with the HMO result of 4o + 
4.4726. The difference is 0.0068, less than 0.0028 per z electron. Hess and Schaad 
show that this level of agreement holds for acyclic polyenes in general, even when there 
is much branching. Thus, it seems that conjugated double bonds in acyclic molecules 
fit into an additivity scheme after all. 

There is an important difference between the energy additivity scheme for conjugated 
systems described above and the familiar additivity scheme used for C-H, C-C, and 
isolated C-C bonds. In the latter cases the energy "contributed" by the bond is generally 
thought of as being the same as the energy of that bond in the molecule (at least in 
an averaged way-some care must be exercised with definitions), and, furthermore, the 
bond is thought to be fairly independent of the identity of the molecule. For instance, 
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TABLE 8-2 » z-Bond Types and Effective Binding Energies for Carbon-Carbon Double 
and Single Bonds 


Bond type Effective binding energy in units of £ 
H 

sc 

Pom 2.0000 
H H 
C C 

BN P4 

poe (cis or trans) 2.0699 
H H 
[o H 

pond 

Ru 2.0000 
[o H 
€ 

eec E 

6-6. 2.1083 
C H 
C C 

‘c=c” 

ak 2.1716 
C C 
H H 

N "4 

IETS (cis or trans) 0.4660 
[o C 
C H 

Ncc i 

Pur (cis or trans) 0.4362 
[o C 
C [o 

bh et : 

yo (cis or trans) 0.4358 
C C 


“The eight numbers associated with these bonds are not unique. They satisfy six simultaneous equations. 
Hence, any two of them may be given arbitrary values and the remaining six found by solving the simultaneous 
equations. Different arbitrary assignments lead to different sets of numbers, all equally valid and all giving 
identical zr-electron energies. These numbers can only be applied when a bond has an unambiguous formal 
identity, which means we must deal with acyclic polyenes (even number of centers and electrons). 


a C-H bond in butane is very similar to one in heptane. But this is not the case for 
conjugated molecules. Inspection of Fig. 8-23 shows that a single C-C bond between 
(formal) double bonds varies significantly in bond order from molecule to molecule. 
Since the total x -electron energy depends on bond order [Eq. (8-58)], we can tell at once 
that the actual theoretical energy contribution due to such a "single" bond varies from 
2 x 0.4472 = 0.89446 in butadiene to 1.0886 in decapentaene. However, examination 
of Fig. 8-23 reveals that, in going from molecule to molecule, as the "single" bonds 
increase in z-bond order, the “double” bonds decrease in order. Thus, adding an 
additional C-C-C group to a chain adds a constant amount of bond energy (about 
2.546) to the total m energy, but this energy increment contains contributions from 
bond order changes over the whole molecule. We have, then, an additive scheme for 
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Figure 8-23 » HMO bond orders for butadiene, hexatriene, octatetraene, and decapentaene. 


a delocalized effect. For this reason, the bond energy contributions of Table 8-2 are 
called effective bond energies. 

We are now in a position to consider the concept of aromaticity. The term “aromatic” 
originally referred to organic molecules having pleasant odors. Later it referred to a 
class of molecules having a high degree of unsaturation. Benzene was recognized as 
the parent compound for many such molecules, and the term aromatic has come to 
mean having chemical properties peculiar to benzene and some of its relatives. 'The 
chemical stability of these molecules, their relatively low heats of combustion or of 
hydrogenation, and their tendency to prefer substitution rather than addition (thereby 
preserving their z systems intact) distinguish these molecules from ordinary polyenes 
and have come to be called aromatic properties, or manifestations of aromaticity.!^ 
These properties suggest that the zr electrons in aromatic systems are unusually low in 
energy, contributing to both the thermodynamic and the kinetic stability of the systems. 
We can test whether this is the case by calculating an expected x energy for benzene 
using the bond energies of Table 8-2 for an alternating single-double bond, or Kekulé, 
structure. The result, 6o + 7.61 8, is significantly less stable (by 0.398) than the HMO 
energy of 6o + 88. We shall refer to this difference as the resonance energy (RE) of 
the system. 


RE-E,(HMO)-— E, (from Table 8-2) (8-63) 


By this definition, a positive RE (in units of 8) corresponds to extra molecular stabiliza- 
tion. If we divide the RE by the number of z electrons, we obtain the RE per electron 
(REPE). Hence, the REPE for benzene is 0.0658. Following Dewar we shall refer to a 
system having significantly positive REPE as aromatic, significantly negative REPE as 
antiaromatic, and negligible REPE as nonaromatic, or polyolefinic. (Recall that B is a 
negative quantity. When REPE is tabulated in eV, it is conventional to use the absolute 
value of P in eV so as to retain positive REPE for aromatic molecules.) 

Cyclic polyenes differ from acyclic polyenes in that many of them show significant 
values for REPE. Some results of Schaad and Hess [18] are reproduced in Fig. 8-24. 


l^The definition of the term "aromatic" is not generally agreed upon. Various criteria based on structural, 
magnetic, or reactivity properties have been proposed. See Minkin et al. [19] for a general review, and Schleyer 
et al. [20] for consideration of ring-current-induced nmr chemical shifts as an aromaticity probe. 


282 Chapter 8 The Simple Hückel Method and Applications 


It is evident that a strong correlation exists between REPE and the presence or absence 
of aromatic properties. 

The change from very aromatic to very antiaromatic nature as we go from a six- 
membered to a four-membered system is striking, If we examine the HMO energy 
levels, itis not difficult to see why these monocycles are so different. All six x electrons 
in benzene occupy bonding MOs, whereas cyclobutadiene has two bonding and two 
nonbonding electrons (XII). The situation is optimal for benzene not only because 
all of its x electrons are bonding, but also because all of the bonding levels are fully 
occupied. Because all monocycles have a nondegenerate lowest level followed by 
higher-energy pairs of degenerate levels, it is not hard to describe the conditions that 
should produce maximum stability. There should be 4n + 2 electrons (2 for the lowest 
level and 4 for each of the n higher bonding levels), and there should be more than 
4n centers (either 4n + 1, 4n 4- 2, or 4n + 3) to force the n doubly degenerate, occupied 
levels to all be bonding. The stability of cyclopentadienyl anion (6 electrons, 5 centers) 
and cycloheptatrienyl cation (6 electrons, 7 centers) was correctly predicted from these 
simple considerations (XIII). Extensive research has gone on in efforts to find examples 


REPE (eV) 
Molecule ({B| = 1.4199 eV) Chemical properties 
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Very stable; Undergoes 
substitution only under 
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0.078 


Simple derivatives isolated; 
undergo electrophilic 
substitution 


. 
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«a 
OO 


0.061 


— 0.003 Isolated, but reactive 


— 0.006 Isolated, but reactive 


—0.026 Me derivative observed 
i spectroscopically at — 196°C 


— 0.038 Never prepared 


aux —0.047 Prepared in very dilute 
solution; extremely reactive 
aN —à3si Prepared at 8°K, vanished 


at 35°K 


Figure 8-24 » Resonance energy per electron for a number of molecules. (From Schaad and 
Hess [18].) 
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(XII) 


where n # 1, but results are often complicated by angle strain and deviation from 
planarity in the molecule.P? 


EXAMPLE 8-7 What would the above approach predict for the relative stabilities 


of cyclic ions CsHZ and C5H; ? 


SOLUTION »  C5H; , the cyclopentadienyl anion shown in XIII, satisfies the 4n + 2 rule and 
should be unusually stable. C5H7 has only four z electrons, hence should be less stable. (It has 
two fewer bonding electrons than C5Hz .) 4 


In a molecule containing both cyclic and acyclic parts, does the RE arise only from 
the cyclic part? For certain cases, the answer is yes. Dewar has recognized that side or 
connecting chains contribute nothing to the RE of a system when the chain in question 
is the same in all formal structures for the molecule. For example, stilbene can be 
written in four equivalent formal ways, as shown below. The linking chain is identical 
in all cases, and it should not contribute to the RE. Calculations confirm that the RE 
for stilbene is just double that for benzene. 


cyclopentadienyl anion cycloheptatrienyl cation 
(XIII) 


The intimate relation between aromaticity and the possibility for more than one 
equivalent formal structure for a molecule has long been recognized. These “mobile” 
bonds tend to favor equal bond lengths in contrast to the strong alternation characteristic 
of acyclic polyenes. In benzene, the extra stabilization may be viewed as resulting from 
the fact that all six bonds are identical and have a higher bond order (0.667) than the 
average of 0.634 for double and single acyclic bonds. In antiaromatic cyclobutadiene, 
the four identical bonds have a bond order of 0.5, which is significantly below the 


I5Research on aromaticity has been extended to Mobius conjugated hydrocarbons, where aromatic behav- 
ior should be associated with 4n electrons (see Herges [21]), and also to clusters of boron or silicon atoms 
(see Ritter [22]), as well as metal atoms (see Chen et al. [23]). 
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(XIV) 


average for acyclics. The very different bond orders in these two molecules is fully 
consistent with the energy argument based on the 4n + 2 rule described earlier.!Ó 

To summarize, the x bonds in acyclic polyenes exhibit delocalization in the sense 
that bond orders are not transferable from one molecule to another, additivity in effective 
bond energies, immobility in formal bonds. Cyclic molecules do not exhibit additivity 
of effective energies or immobility of formal bonds. Their energy deviations from 
energy calculated assuming additivity and immobility are good indicators of kinetic 
and thermodynamic stability. Aromatic molecules possess extra stability because their 
x electrons!" are more bonding than those in acyclic polyenes. Antiaromatics are 
unstable because their z electrons are less bonding. 

From a consideration of experimental heats of atomization, Schaad and Hess have 
evaluated f to be —1.4199 eV. The physical processes involved in dissociating a gas- 
phase molecule into constituent atoms are quite different from those involved in adding 
or removing a z electron from a molecule in a solvent. Therefore, it is not surprising 
that the 6 value obtained from heats of atomization differs substantially from values 
obtained from redox experiments. Indeed, itis this variability of P as we compare HMO 
theory with different types of experiment that compensates for many of the oversights 
and simplifications of the approach. It is remarkable that, with but one such parameter, 
HMO theory does as well as it does. 


8-17 Extension to Heteroatomic Molecules 


The range of application of the HMO method could be greatly extended if atoms other 
than carbon could be treated. Consider pyridine as an example (XV). A z electron at 
a carbon atom contributes an energy o to Ex. The contribution due to a x electron at 
nitrogen is presumably something different. Let us take it to be a’ =a + hB, where 
h is a parameter that will be fixed by fitting theoretical results to experiment. If the 
7 electron is attracted more strongly to nitrogen than to carbon, A will be a positive 


16Tt has been noted that bond length equalization associated with bond mobility results in it energy lowering 
when the C-C—C angles are near 120°. When the angle is very different from this, 2 energies are higher than 
expected. This has led to suggestions that "strain energy” may be an important factor in aromaticity. Because 
of the present lack of a quantum mechanical quantity equivalent to strain energy, and because the HMO method 
may include effects of o electrons in an implicit but poorly understood way, it is very hard to know whether such 
suggestions are at variance with other statements or are simply equivalent to them but stated from a different 
viewpoint. 

17t is not necessarily true that all the “extra” stability of aromatic molecules is attributable to x -electron effects; 
c -electron energies also depend on bond lengths and bond angles. Hence, we may be seeing, once again, a situation 
where the zr -electron treatment includes other effects implicitly. Schaad and Hess [18] indicate that o energies 
and z energies are indeed simply related over the bond-length range of interest. 
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number. In a similar spirit, we will take the energy of a x electron in a C-N bond to be 
B' = kB and evaluate k empirically. Not surprisingly, the values of h and k appropriate 
for various heteroatoms depend somewhat on which molecules and properties are used 
in the evaluation procedure. A set of values compiled and critically discussed by 
Streitwieser [7] is given in Table 8-3. Other sets have been published.!* 

The dots over each symbol indicate the number of z electrons contributed by the 
atom. In pyridine, the formal bond diagram indicates a six x -electron system, implying 
that the nitrogen atom contributes one x electron. We also can argue that, of the five 


TABLE 8-3 » Parameters for Heteroatoms in the Hückel Method? 


Heteroatom h Heteroatomic bond k 
N 0.5 C—N 1.0 
N 1.5 C—N 0.8 
N 2.0 
Ó 1.0 C—Ó 1.0 
Ó 2.0 C—Ó 0.8 
Ó 2.5 N—Ó 0.7 
F 3.0 C—F 0.7 
Cl 2.0 C-ci 0.4 
Br 1.5 C—Br 0.3 
S 0.0 C—S' 0.8 
S" 0.0 C—s" 0.8 
S'—S' 1.0 
Methyl (inductive C, —Me)-0.5 hc, =—0.5 none — 
Methyl (heteroatom C4 —Me)0.2 hwe-— 0.2 Cy—Me 0.7 
Methyl (conjugative C4—C--H3) hc, ——0.1 Cy—C 0.8 
he =-—0.1 C—H3 3.0 
hy, = —0.5 


“Consistent with the philosophy of this approach is a distinction between single, double, and intermediate C—C 
bonds. Streitwieser recommends kc_c = 0.9, kc=c = 1.0, kc=c = 1.1. 

Sulfur is treated as a pair of AOs with a total of two x electrons, i.e., a sulfur in an aromatic ring is formally 
treated as two adjacent atoms S' and S" with the indicated parameters. 


18See McGlynn et al. [24, p. 87]. 
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valence electrons of nitrogen, two are involved in o covalent bonds with neighboring 
carbons, two more are in ac lone pair, leaving one for the x system. Therefore, the atom 
parameter to use for this molecule is A — 0.5. The pyridine ring, like benzene, admits two 
equivalent structural formulas, and so the C-N bonds should be intermediate between 
double and single, symbolized C=N in Table 8-3. Since k = 1.0 in this case, 8’ = B, and 
pyridine will have an HMO determinant differing from the benzene determinant only 
in the diagonal position corresponding to the nitrogen atom-the 1, 1 position according 
to our (arbitrary) numbering scheme. For this position, instead of x, we will have 


x' — (o/ — E)/B = (a +0.58 — EJ B = (a — EB -4-0.58/B—x--0.5 — (8-64) 


The pyrrole molecule has a nitrogen atom of the type N(XVI). Since three valence 
electrons of nitrogen are in covalent o bonds, two remain for inclusion in the x system. 


H H 
N 7 


C—C 
fe 4N 
C? 5c 
HNIT N 

H N H 


| 
H 


(XVI) 


Therefore, pyrrole has a total of six z electrons. The unique structural formula indicates 
that the C-N bond is formally single, and k = 0.8, h = 1.5 are the appropriate parameters 
here. Also, the carbon-carbon bonds are now formally single or double. If we choose to 
distinguish among these bonds using the parameters in note a of Table 8-3, the resulting 
HMO determinant is 


x+15 0.8 0 0 0.8 
0.8 x 11 0 0 


The methyl group can also be incorporated into the HMO method. Several 
approaches have been suggested. One is simply to modify the coulomb integral o for 
the carbon to which the methyl group is attached. A methyl group is thought to release 
sigma electrons to the rest of the molecule as compared to a substituent hydrogen. This 
suggests that an atom having a methyl group attached to it will be a bit electron rich 
and hence will be less attractive to zr electrons. Use of a negative h parameter for this 
carbon is appropriate. This method is called the inductive model. Use of the inductive 
model does not add any new centers or any more z electrons to the conjugated system 
to which the methyl group is attached: The carbon to which the methyl is bonded is 
merely treated as a less attractive atom. A second approach is to treat the methyl group 
itself as a heteroatom. As we shall see shortly, the methyl group has two electrons that 
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Figure 8-25 » Three MO symmetry solutions for a methyl group attached to a benzene ring. 


can participate (to a slight extent) in the x system, and so use of this heteroatom model 
adds one more center and two more z electrons for each methyl group included in this 
way. A third approach is the conjugative model. Because the methyl group has local 
threefold symmetry, one of the c MOs for the methyl group resembles the 7 MOs 
of cyclopropenyl in symmetry characteristics. The three types of symmetry solutions 
are given in Fig. 8-25 for a methyl group on a benzene ring (compare with Fig. 8-7). 
Notice that the MO at the right of the figure has its phases arranged so that it has 
nonzero overlap with a z AO on the benzene ring. This means that the two electrons in 
this “methyl group MO” can participate in the x system of the molecule. To emulate 
this picture in our HMO determinant, we must add two z electrons and two more 
centers to our system (one for C and one for H3) and find A and k values for the two 
new centers and bonds. The inductive effect of the methyl group on the neighboring 
ring carbon is often included in this model. In most cases, it is probably safe to say 
that the conjugative model is superior to the heteroatom model, which is, in turn, better 
than the inductive model, but no extensive critical comparison of these three models 
has been made. Parameters for all three approaches are included in Table 8-3. (Cy is 
the ring carbon.) 


[| 8-18 Self-Consistent Variations of « and £ 


Efforts have been made to improve the HMO method by taking account of molecular 
7 charge distribution. Suppose that we carry out an HMO calculation on a nonalternant 
molecule and find an electron density of 1.2 at one carbon and 0.8 at another. It is 
reasonable to argue that a z electron at the latter carbon is more strongly bound because 
it experiences less repulsion from other z electrons there. We can try to account for 
this by making o at that atom more negative. Thus, we could take 


o; =a; 4- o1 — qi) (8-65) 


where q; is the z-electron density at atom i and c is a parameter (assumed positive) 
to be fixed empirically. If q < 1, then a’ is more negative than a. If g — 1, o is 
less negative. Having now modified o (using a trial value for w), we must set up 
our new HMO determinant and solve it again. This yields new MOs, new values 
of qi, and therefore new values of a’. We repeat this process over and over until 
electron densities remain essentially unchanged for two successive iterations. At this 
point, the electron densities leading to the HMO determinant are the same as those 
produced by the determinant, and the solution is said to be self-consistent with respect 
to electron densities. This procedure, often referred to as the “w technique,” discourages 
extreme deviations of electronic densities from the “norm” of unity at each carbon and 
thereby helps to compensate for the lack of explicit inclusion of x electron repulsion 
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in the HMO method. Streitwieser's calculations have led him to favor a value for 
c of 1.4. 

A similar idea has been applied to variations of the bond integral 6. Suppose that 
we carry out an HMO calculation and find a zr-bond order of 0.5 in one bond and 
0.9 in another. We expect that the latter bond is in fact shorter than the former. We 
could roughly predict how much shorter it is by using the bond-order-bond-length 
relation described earlier. It is reasonable to modify f in these bonds on the basis 
of predicted length differences, set up a new HMO determinant, solve again, find 
new bond orders, and iterate until self-consistency is achieved with respect to bond 
orders. 

These modifications to the simple HMO method improve predictions of some prop- 
erties but not others. For example, Brogli and Heilbronner [13] have found that orbital 
energy correlation with ionization energy, determined by photoelectron spectroscopy, 
is significantly improved through inclusion of the effects of bond length variations in 
the neutral molecule and the cation. 'This improved correlation, shown in Fig. 8-26, 
showed no additional improvement upon subsequent variation of o as a function of elec- 
tron density. This is reasonable, since f variation affects primarily bond order, hence 
MO energy, and that is the property measured by photoelectron spectroscopy. Variation 
of a shifts charge from atom to atom, but has smaller energy effects. On the other 
hand, z -electron contributions to dipole moments, calculated from electronic excess 
or deficiency at each center, are very sensitive to variation of o, and quite insensitive 


HMO values for IE (eV) 


7 8 9 10 11 12 13 
Experimental IE (eV) 


Figure 8-26 » Experimental ionization energy for alternant and nonalternant hydrocarbons versus 
orbital energy using a modified HMO technique which includes provisions for bond length variation 
in molecule and cation. (Compare with Fig. 8-22.) 
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to variation of 8. Again, this is reasonable because dipole moments are sensitive to 
electronic distribution rather than MO energy. 


| J 8-19 HMO Reaction Indices 


In this section, we discuss some applications of the HMO method to reactivities of 
conjugated molecules. The reactions of conjugated molecules that have received most 
of this theoretical treatment are: 


1. electrophilic aromatic substitution (XVII) 


+ Ht 


(XVID 


2. nucleophilic aromatic substitution (XVIII) 


+ LiH 


Mee! 


3. radical addition (XIX) 


1 
2 Tm 
Toe Ne : Pr «Co 


(XIX) 


For any of these reactions, we imagine there to be a path of least energy connecting 
reactants with products. For the two distinct reaction positions, 1 and 2 on naphthalene, 
the activation energies e may differ, as indicated in Fig. 8-27. 

The problem is somehow to relate the differences in e (inferred from relative rate 
data) to a number based on quantum chemical calculations. To do this in a sensible way 
requires that we have some idea of the detailed way in which the reaction proceeds- 
we have to know what the reaction coordinate is. In some cases, this is fairly well 
known. For electrophilic aromatic substitution reactions, evidence suggests that a 
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Potential energy 


Reactants Transition State Products 


Figure 8-27 » Generalized energy versus reaction coordinate for reaction at two positions in 
naphthalene. 


positive electrophile (e.g., CI*) approaches the substrate (say, naphthalene). As it 
draws closer, it causes a significant polarization of the z-electron charge distribution, 
drawing it toward the site of attack. Ultimately, it forms a partial bond with the carbon. 
At this stage, the carbon has already begun to loosen its bond to hydrogen, but it is at 
least partially bonded to four atoms (XX). This means that the ability of the carbon to 
participate in the aromatic system is temporarily hampered. At this point, the system is 
at or near the transition state. Thereafter, as Ht leaves, the potential energy decreases 
and Cl moves into the molecular plane. A similar detailed mechanism is thought to 
apply for nucleophilic reactions except that the attacking group is negative. 


H 


4 Kel 
CERA. wA 


^w P 
C 
c—c** BS 


H 


Hes 
(XX) 


For attack by a neutral radical, electrostatic attraction and charge polarization should 
not be significant factors. The radical bonds to the site of attack to produce a more-or- 
less tetrahedrally bonded carbon. This again leads to an interruption of the system, 
but now it is not temporary as it was in the substitution reactions. 

Based on these simple pictures, a number of MO quantities, often referred to as 
reaction indices, have been proposed as indicators of preferred sites for reaction. It is 
useful to divide these into two categories-those purporting to relate to early stages of 
the reaction, and those specifically related to the intermediate stage. 

Perhaps the most obvious reaction index to use for the earliest stages of electrophilic 
or nucleophilic reactions is the zrelectron density. If Cl* is attracted to zr charge, 
it should be attracted most to those sites where x density is greatest. (Such an ion 
should be attracted to sites having excessive o charge density also, but our basic HMO 
assumptions ignore any variations in o density.) For an alternant hydrocarbon like 
naphthalene, all m densities are unity, so this index is of no use. For nonalternant 
molecules, however, it can be quite helpful. Azulene has varying HMO z densities 
(XXI). (More sophisticated calculations described in future chapters are in qualitative 
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agreement with these z-electron density variations.) Experimentally, it is found that 
electrophilic substitution by Cl occurs almost entirely at position 1 (or 3). Nucleophilic 
substitution by CH3 (from CH3 Li) occurs at the position of least zr-electron density, 
namely 4 (or 8). 


0.855 


1.173 4 3 0.986 
10 
LN 60.870 
8 2 


(XXD 


The charge density index refers to the nature of the molecule before allowance is made 
for perturbing effects due to the approaching reactant. Such a method is often called 
a "first-order" method, a terminology that is discussed more fully in Chapter 12. For 
alternant molecules, it is necessary to proceed to a high-order method, one that reflects 
the ease with which molecular charge is drawn toward some atom, or pushed away from 
it, as approach by a charged chemical reactant makes that atom more or less attractive for 
electrons. An index which measures this is called atom self-polarizability, symbolized 
Tr r. The formulas for this and related polarizabilities are derived in Chapter 12. For 
now, we simply note that the formula is 


occ unocc 


dqr Eg c 
Tre = 5 "2 » EE (8-66) 


A larger absolute value of z,; means that a larger change in z density q; occurs 
as a result of making atom r more or less attractive for electrons. (7r, is negative 
since E; — Ex is negative. This makes physical sense because it means that if da, 
is negative, making atom r more attractive, ôq, is positive, indicating that charge 
accumulates there.) Since the most polarizable site should most easily accommo- 
date either a positive- or a negative-approaching reactant, this index should apply 
for both electrophilic and nucleophilic reactions. For naphthalene, the values are 
711 = —0.433/ |B| , 712? = —0.405/ |8|. This agrees with the experimentally observed 
fact that the 1 position of naphthalene is more reactive for both types of reaction. 
Examination of Eq. (8-66) indicates that the MOs near the energy gap between 
filled and empty MOs will tend to contribute most heavily to 7, because, for these, 
Ej; — Ex is smallest. For this reason, the highest occupied and lowest unfilled MO 
(HOMO and LUMO) are often the determining factor in relative values of 7. Fukui!? 
named these the frontier orbitals and suggested that electrophilic substitution would 
occur preferentially at the site where the HOMO had the largest squared coefficient. In 
nucleophilic substitution, the approaching reagent seeks to donate electronic charge to 
the substrate, so here the largest squared coefficient for the LUMO should determine the 
preferred site. For even alternants like naphthalene, the pairing theorem forces these two 
MOs to have their absolute maxima at the same atom. The HOMO-LUMO coefficients 
for naphthalene are 0.425 and 0.263 for atoms 1 and 2, respectively, in accord with 
our expectations. For the nonalternant molecule azulene, discussed above, the largest 
HOMO coefficient occurs at atoms 1 and 3, which have already been mentioned to 
be the preferred sites for electrophilic attack. The largest LUMO coefficient occurs at 


19See Fujimoto and Fukui [25]. 
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atom 6, with atoms 4 and 8 having the second-largest value (see Appendix 6). Atoms 
4 and 8 are the preferred sites for nucleophilic attack. Here, then, is a case where 
the charge density and frontier MO indices are not in agreement. The results suggest 
that, when significant z-density variations occur, this factor should be favored over 
higher-order indices. However, the whole approach is so crude that no ironclad rule 
can be formulated. 

For radical attack, some other index should be used, for we do not expect electro- 
static or polarization effects to be important in such reactions. An index called the 
free valence?? has been proposed for free radical reactions. One assumes that the free 
radical begins bonding to a carbon atom in early stages of the reaction and that the ease 
with which this occurs depends on how much residual bonding capacity the carbon has 
after accounting for its regular x bonds. Thus, free valence is taken to be the difference 
between the maximum z bonding a carbon atom is capable of and the amount of z 
bonding it actually exhibits in the unreacted substrate molecule. The extent of x bond- 
ing is taken as the sum of all the orders of z bonds involving the atom in question. 
A common choice of reference for a maximally bonded carbon is the central atom in 
trimethylenemethane in its planar conformation (which is not the most stable). (XXII). 
Each bond in this neutral system has a z-bond order of 1/4/3, and so the total 2-bond 


(XXII) 


order associated with the central carbon is 4/3.?! Using this as reference, the free 
valence for some atom r in any unsaturated hydrocarbon is defined as 


neighbors of r 


FK-243— M prs (8-67) 


Z 


A common way of representing the situation schematically is indicated in (XXIIT) for 
butadiene. Bond orders are indicated on the bonds and free valences by arrows. It is 
clear that butadiene has a good deal more “residual bonding capacity” on its terminal 
atoms, and this is consistent with the fact that free radical attack on butadiene occurs 
predominantly on the end atoms. Other examples of correlation between free valence 
and rate of free radical addition have been reported.?? A plot of rate data for methyl 


C 0.894 C 0.447 C C 


0.838 0.391 
(XXIII) 


20See Coulson [26]. 

21 Sometimes the three sigma bonds are included in this calculation, giving 3 + 4/3. This has no effect on the 
question of relative values of F;. 

22 See Streitwieser [7] and Salem [1]. 
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Log (methyl affinity) 


F, (max) 


Figure 8-28 » Rate data for methyl radical addition are plotted against the maximum free valence 
found in each molecule. The original “methyl affinity" (Levy and Szwarc [27]) has been multiplied 
by 6/m, where m is the number of sites having maximum free valence. (The asterisks in the figure 
identify these sites.) 


radical addition to conjugated molecules versus the largest free valence of the molecule 
is shown in Fig. 8-28. (We assume that the kinetics is dominated by the atom(s) having 
the maximum free valence.) 

The indices described above are most appropriate for indicating the relative ease of 
reaction in the early stages. By the time the reactants have reached the transition state, 
the substrate is quite far from its starting condition, so charge densities, polarizabilities, 
free valences calculated from the wavefunction of the unperturbed molecule may no 
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longer be very appropriate. If the energy curves being compared through our indices 
behave in the simple manner described in Fig. 8-27, so that the higher-energy curve in 
early stages is also the higher-energy curve in the region of the transition state, such 
indices can be useful. Also, such simple behavior is very likely to occur when we 
compare a single type of reaction down a series of molecules of similar type, as in 
Fig. 8-28 (see also Problem 8-25). Experience, however, has indicated that indices 
more closely linked to the nature of the transition state are more generally reliable. We 
now describe one such reactivity index. 

We mentioned earlier that addition and substitution reactions are expected to interrupt 
the z system at the site of attack. For substitution reactions, this interruption is only 
temporary and is presumably most severe in the transition state. For addition reactions, 
it is permanent. The localization energy is defined as the x energy lost in this process 
of interrupting the zr system. 

As an example, let us return to the naphthalene molecule. The situations resulting 
from interruption of the x system by neutral radical attack at positions 1 and 2 are 
illustrated in (XXIV). The remaining unsaturated fragment is, in each case, a neutral 
radical. [If attack were by a negative ion (nucleophilic), the fragment would be topo- 
logically the same but would be negatively charged. Likewise, attack by a positive 
electrophilic reagent leads to a positively charged fragment.] No matter where attack 
occurs, our zr energy must go from 10a + --- 6 to 9o 4 --- B. This decrease by o is 
thus not expected to differ from case to case and hence is ignored in our localization 
energy calculation. The decrease in zr energy is thus 2.299 || for attack at position 1 
and 2.480 |6| for position 2. These localization energies indicate that attack at position 
1 should be favored since the energy cost is smaller there, and this is in accord with 
observation. (Localization energies are symbolized L;., L}, L7 depending, respec- 
tively, on whether attack is by a free radical, an electrophilic cation, or a nucleophilic 
anion.) 


Saturated 


——— 
a 9-center radical 
E, = 9a + 11.3848 


1 
2 
p esa? Saturated 
attack at 2 
= .683 £ 
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———— 
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E, = 9a + 11.2038 


attack at 1 


(XXIV) 


It is interesting to compare the various indices we have discussed for a single 
molecule to see how well they agree. Data for azulene are collected in Table 8-4. 
Experimentally, azulene is known to preferentially undergo electrophilic substitution 
at positions 1 (and 3), nucleophilic substitution at positions 4 (and 8), and radical addi- 
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TABLE 8-4 » Reactivity Indices for Azulene 


r HOMO LUMO 
(atomno) qr [ltr c; c; Fr Lr*(BD Lr*'(BD — Lr (Bb 
1 1.1173 0.425 0.2946 0.0040 0.480 1.924 | 22262 2.600 
2 1.047 0.419 0.0000 0.0997 0.420 2.362 2.362 2.362 
4 0.855 0.438 0.0256 0.2208 0.482 2.551 2.240 1.929 
5 0.986 0.429 0.1126 0.0104 0.429 2.341 2.341 2.341 
6 0.870 0.424 0.0000 0.2610 0.454 2.730 2.359 1.988 


tion in positions 1 (and 3). Consider first electrophilic reaction. Examining the table 
indicates that position 1 is heavily favored by q,, HOMO distribution, and L*. The 
only other index relevant for this process, 7,,, favors position 4. For nucleophilic 
reaction, qr, Tyr, L; all favor position 4. The LUMO index favors position 6, but not 
decisively over position 4. For radical addition, LUMO favors position 6, whereas F, 
and L; favor position 4. The latter two indices, however, favor 4 over 1 by only a 
slight margin. Thus, for a nonalternant molecule like azulene, these numbers are not 
completely trustworthy and must be interpreted with caution. One difference between 
a molecule like anthracene and one like azulene is that all the C-C-C angles in the 
former molecule are similar (~ 120°) whereas in azulene they differ. One might antic- 
ipate that the smaller angles in the five-membered subunit, being already closer to the 
tetrahedral angle characteristic of saturated carbons, would allow easier substitution or 
addition than would be the case in the seven-membered subunit. This factor is ignored 
in our calculations of L; and might easily tip the balance to favor position 1 over posi- 
tion 4 for radical attack since L} and L are so close in value. In short, these HMO 
reactivity index approaches are once again techniques that ignore many aspects of the 
physical processes being followed. It seems likely that many of these will cancel out of 
comparisons among similar molecules, but dissimilarities between or within molecules 
(most often encountered in nonalternant systems) will cause such cancellations to be 
less complete. For more discussion of these and other HMO reaction indices, the reader 
is referred to more specialized discussions.? Application of some of these indices to 
carcinogenicities of polycyclic aromatic hydrocarbons has been reviewed.?^ 


E 8-20 Conclusions 


In this chapter, we have seen how certain basic features of molecular structure manifest 
themselves in molecular properties. The connectedness, or o bond network, defines 
the bond positions where z electrons can congregate to lower the energy of the system. 


23See Streitwieser [7], Salem [1], Dewar [14], and Klopman [28]. 
24See Lowe and Silverman [29]. For a dialogue for for nonspecialists by the same authors, see [30]. 
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The extent of congregation is a useful measure of bond length and also is directly 
contributory to the total energy of the system. 

The symmetry restrictions for MOs have been emphasized. The fairly successful 
correlation of HMO results with certain experimental measurements suggests that the 
method effectively accounts for the controlling factors in some molecular properties. 

However, we have omitted discussion of other properties that correlate only poorly 
with HMO theory. Notable in this regard are spectral energies. The natural idea of 
relating a spectral transition energy to a difference between orbital energies has not 
been very successful in HMO theory, except when one restricts attention to a particular 
band in a series of related molecules. This is at least partly due to improper handling of 
electron exchange symmetry in the HMO method leading to an inability to distinguish 
between different states of a given configuration. 

The HMO method can be an extremely instructive way to approach a problem since 
it can describe the manner in which certain important factors are operating. Also, 
for some situations, its predictive power is rather good. However, the limitation to 
conjugated systems, the reliance on an increasing number of parameters as extensions 
are made, the inability to conform to some kinds of experimental measurement, and 
the conceptual slipperiness of the quantities used in the method have all contributed to 
a decline of interest in further development of HMO theory. More powerful computers 
have made it possible for more complicated but better defined methods to be used. 


8-20.A Problems 


8-1. Show that, if H = Ay + Ay + Ay, and f,9;() = Ejó;(i), then Wproa = 


$1(1)¢2(2)63(3) and vae = |$1(1)$2(2) $3 (3)| are both eigenfunctions of Ê 
and have the same eigenvalue. Show also that y = (1/ J2)[b1(1)62(2)63(3) + 
1 (1)$2(2)64(3)] is an eigenfunction of H if and only if £3 = E4. 


8-2. Set up the HMO determinant for each of the following molecules: 


ii 
1 H H 
H P4 
\3 2% es Cs 
—Č Nato 
(a) l é (b) Lx. | 
b SN p Cs 
C—H H H 
/ 6 
H 
(c) H3C—CH,—CH,—CH=CH, (d) H;C—CH-—CH;—CH;—CH-—CH; 


(e) H—CzsC—H 


8-3. Set up and solve the Hückel determinantal equation for 2-allyImethyl (also called 
trimethylenemethane) (XXV). Display the orbital energy levels and indicate the 
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electron configuration for the neutral ground state. Calculate E,. Find the coeffi- 
cients for all MOs. (Be sure that degenerate MOs are orthogonal.) Calculate the 
charge densities and bond orders. 


8-4. Suppose that two MOs of a molecule are given by the formulas 


$i = (1/V3) x1 + (1/43) xo + (1/3) xs, 
bo = (1/3) ya + (1/3) xa + (0/A/3) xs, 


where the x's are AOs which are assumed to be orthonormal. By inspection, what 
is the overlap between these MOs? 


8-5. Given the following two degenerate MOs for cyclobutadiene (assumed square 
planar): 


$: $; ve 


Use the Schmidt procedure to obtain a normalized MO that has the same energy 
as $» but is orthogonal to $1. 


8-6. For cyclooctatetraene (in its idealized, but incorrect, planar, octahedral form), see 
if you can answer the following without reference to tabulations: 


a) What is the HMO energy of the highest energy MO? 
b) Sketch this MO (from above the molecule) showing signs and magnitudes 
(actual numbers) of the MO coefficients. 


8-7. a) What is the Hückel orbital energy for the following MO? (Assume that all 
centers are carbons.) [Hint: Use Eq. (8-56).] 


4 
64 02/0374. Ce 
P e doce gei di 
5 3 3 3 3 
b) Calculate the contributions to bond orders due to one electron in this MO. 
c) Calculate the Hückel energy of the following MO. (Figure out the value of a 
if you need to use it.) 


8-8. Without performing an HMO calculation, sketch the MOs for the pentadienyl 
radical. Use the particle-in-a-box solutions and the pairing theorem as a guide. 
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8-9. 


8-10. 


8-11. 


8-12. 
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Which, if any, of the following systems would you expect to exhibit Jahn- Teller 
distortion? Indicate your thinking. (You should be able to answer without 


reference to HMO data tables.) (a) Benzyl radical, C7H7, Oe (b) Cyclopen- 
tadienyl radical, C5H5. (c) Cyclobutadienyl radical cation, C4H;. (d) Benzene 
cation, CeH,. 


The bond lengths given in Table P8-10 have been reported for ovalene (XXVI). 
Using a library source or a computer program, obtain HMO bond orders for 
ovalene and calculate theoretical bond lengths using a relation from Section 8-12. 
Make a comparison plot for ovalene of the type shown in Fig. 8-16. (See caption 
of Fig. 8-16 for values of k and s.) 


TABLE P8-10 » 


Bond Length (À) Bond Length (À) 
1-2 1.445 1—10 1.401 
2-3 1.354 6—7 1.419 
3-4 1.432 4—25 1.411 
4-5 1.429 24 — 25 1.366 
5-6 1.429 5—22 1.424 
6-1 1.425 7—20 1.435 


(XXVI) 


Horrocks et al. [31] report experimental bond lengths for quinoline complexed 
to nickel. They display a comparison plot that uses theoretical data from an 
MO method more refined than the simple HMO method. Using the appropriate 
heteronuclear parameters, perform on the computer an HMO calculation for 
quinoline. Calculate theoretical C-C bond distances and compare them with the 
experimental and theoretical data of Horrocks et al. 


When the molecule CH? —CH-CH-O absorbs light of a certain frequency, a 
lone-pair electron on oxygen (called “n” for nonbonding) is promoted to the 
lowest empty z MO of the molecule (called zr *; hence, an n —9 7* transition). 
Assuming that the zz MOs of this molecule are identical to those in butadiene, 
which C-C bond would you expect to become longer and which shorter as a result 
of this transition? Calculate the expected bond length changes using butadiene 
data, using either set of k, s values in the caption of Fig. 8-16. (Observed: 
ACH;-CH = 0.06 A, ACH-CH = —0.04 A.) 


Section 8-20 Conclusions 299 


8-13. ESR coupling constants are shown in Table P8-13 for six hydrocarbon anion rad- 
icals. Use HMO tabulations in the literature (or a computer) to obtain x -electron 


MO coefficients for these systems. Construct a plot of coupling constant ag, 


versus Der where i is the MO containing the unpaired electron (a Ay values are 


in gauss). The numbered positions in Table P8-13 refer to hydrogen atoms. 


TABLE P8-13 » 


8-14. Polarographic half-wave potentials for oxidation and reduction of aromatic 
hydrocarbons are given in Table P8-14. 


a) Make separate plots of these data against energy (in units of 6) of the highest 
occupied and lowest empty MO respectively. (Use tabulations or a computer 
program.) 

b) Now plot reduction versus oxidation half-wave potential for this series. 
Explain adherence to or deviation from linearity. 


TABLE P8-14 » 


Reduction half-wave Oxidation 
potential in half-wave 
2-methyoxyethanol potential in 
Compound Structure (V) acetonitrile (V) 
S 
Tetracene ZA 1.135 0.54 


1,2-Benzpyrene 


cee 1.36 0.76 
Anthracene (TT 1.46 0.84 


(Continued) 
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TABLE P8-14 » (Continued) 


Reduction half-wave Oxidation 
potential in half-wave 
2-methyoxyethanol potential in 


Compound (V) acetonitrile (V) 


1,2-Benzanthracene 


Structure 
Pyrene E 1.61 0.86 
sov 1.53 0.92 


1,2,5,6-Dibenzanthracene 


a 
m 1.545 1.00 
Phenanthrene 4e A 1.935 1.23 


1.345 1.18 


Fluoranthene e e 
Naphthalene 1.98 1.31 


Biphenyl 2.075 1.48 
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8-15. 


8-16. 


8-17. 


8-18. 


Use tabulated or computer-generated HMO data for neutral azulene (XXVII) to 
answer the following questions (Tabulated data may be found in Appendix 6.): 


(XXVID 


a) What values would you expect for oxidation and reduction half-wave poten- 
tials for this molecule under conditions described in Problem 8-14? 

b) If an electron were removed from the highest occupied MO to produce an 
ion, which bonds would you expect to lengthen, which to shorten? 


Use the effective bond energies of Table 8-2 to calculate the expected 7 energy 
for (XXVIII). Compare this with the HMO energy of 18a 4- 21.9065. 


a t 
-<E 


(XXVID |. —— 


Obtain the HMO data for naphthalene (XXIX) and perylene (XXX): 


. CS 
Z 
x CC 


(XXX) 


a) For each molecule, compare Æ to the energy predicted by use of Table 8-2. 
Categorize each molecule as aromatic, nonaromatic or antiaromatic. 

b) Compare the RE for these two molecules. Does the central ring in perylene 
appear to be contributing? 

c) Draw formal bond structures for perylene. What can you conclude about the 
two bonds connecting naphthalene units in perylene? 

d) Use HMO bond orders to calculate a predicted length for these two bonds. 
How do they compare with the observed 1.471 À value? Is this observed 
length consistent with your conclusion of part (c)? 


The third, fourth, and fifth molecules in Fig. 8-24 have dipole moments. Assum- 
ing that the individual rings attract or repel charge in accordance with our expec- 
tations from the 4n + 2 rule, predict the direction of the r-electronic contribution 
to the dipoles. (Dipoles are defined by chemists as being directed from positive 
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8-19. 


8-20. 


8-21. 


8-22. 
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toward negative ends of electric dipoles. Physicists use the opposite conven- 
tion.) How would you expect the zr -electron densities to vary in these molecules? 
Compare your expectations with tabulated densities. 


How many z electrons are there in each of the following neutral molecules? 
Assume that each one is planar. 


H 
H O H 
H H Na 
f N H YN H X 
H C= 
HA jH A NJ : SN 
| ON“ h H H 
(a) (b) (c) 
H H 
(d) 


Use the parameters in Table 8-3 to construct the HMO determinant for molecule 
(XXXI). Use the conjugative model for the methyl group. 


(XXXI) 


Substitution of a nitrogen for a carbon in benzene changes the HMO energy 
levels, but not drastically. Hence the stability of the six -electron molecule 
pyridine can still be rationalized by the 4n + 2 rule. Which member of each of 
the following pairs of molecules would you expect to be stable on the basis of 
such arguments 


H 
NOUN 
eC "COMO OO 
(0) (0) N `N N N 
l u 


Itisobserved that many even alternant hydrocarbons tend to undergo nucleophilic 
substitution, electrophilic substitution, and radical addition at the same site(s). 
Rationalize this behavior in terms of the following indices (where appropriate): 
qr, HOMO, LUMO, L.. 
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8-23. 


Can 7r. be a successful index for nucleophilic and electrophilic substitution if 
these are observed to occur at different sites? 


8-24. For the methylene cyclopropene system C4H4 (see Appendix 6 for HMO data 


8-25. 


and atomic numbering scheme): 


a) Calculate the free valences for the neutral molecule. 

b) Decide, using three appropriate reactivity indices, which site is most suscep- 
tible to electrophilic attack. 

c) Decide which protons would lead to hyperfine splitting of the ESR spectrum 
of the radical anion, according to the simple Hückel approach. 

d) Decide whether the second-lowest MO is net bonding or net antibonding. 
Why? 


Published data for free radical (CCI,) addition to hydrocarbons are shown in 
Table P8-25; see Kooyman and Farenhorst [32]. Assuming planarity in each 
case, use standard HMO tabulations or computer programs to obtain F, values for 
each molecule. Choose the largest value for each molecule and plot this against 
the log of the modified rate constant. In each case, modify the rate constant by 
dividing k by the number of sites on the molecule having the maximum F, value. 


TABLE P8-24 » 


Molecule Structure Rate constant 

Benzene CJ «107? (use this limit) 
Biphenyl Co» 2.7 x 10? 

Triphenylene oat <4 x 10~? (use this limit) 
Phenanthrene Sar 1.6 x 107? 

Naphthalene CX 4 x 107? 


(Continued) 
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TABLE P8-24 » (Continued) 


Molecule Structure Rate constant 
Chrysene Affe 6.7 x 107? 
Pyrene ae 1.3 
Stilbene ara", 10 
1,2,5,6-Dibenzanthracene cor 3.7 

ao 
Styrene 12.5 
Anthracene eee 22 
Benzanthracene seer 30 
3,4-Benzopyrene b 70 
Naphthacene CLO 102 
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(If a molecule has F, values differing by 0.002 or less, treat them as equal.) If 
any data points deviate greatly from the general trend, try to give an explanation. 


8-26. Whichring ofthe fourth molecule in Fig. 8-24 would you predict an electrophilic 
reagent would be more likely to attack, and why? 


8-27.* Which of the following neutral unsaturated planar hydrocarbons should expe- 
rience greater changes in bond order when a z electron is added to form the 
anion, and why? 


(a) (b) 


8-28.* Consider an ESR experiment on the odd-alternant radical shown. Identify the 
site(s) that have attached hydrogen atoms that should give 


a) the largest coupling constant. 
b) the second-largest coupling constant. 
c) no coupling constant. 


8-29.* Which of the following two seven-z -electron systems should be easier to ionize, 
and why? 


(a) (b) 


Multiple Choice Questions 


(Try to answer the following questions from sketches that you generate without refer- 
ence to the text.) 


1. According to simple HMO theory, which one of the following statements about 
butadiene is true? 


a) It has a nonbonding MO. 
b) Exciting an electron from the HOMO to the LUMO will not change the x bond 
orders. 


*This problem assumes prior study of Appendix 5. 
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c) Exciting an electron from the HOMO to the LUMO will not change the z charge 
densities. 

d) Ionizing an electron from the HOMO results in all z charge densities becoming 
equal to 0.75. 

e) It is most likely to undergo electrophilic substitution at one of the two inner 
carbons. 


2. Which of the following is a prediction that would result from a simple HMO treat- 
ment of the butadienyl cation, C4H7? 


a) The ESR coupling constant is larger for hydrogens attached to the two inner 
carbons. 

b) The central C-C bond has a higher z-bond order than it has in the neutral 
molecule. 

c) The positive charge resides mostly on the two central carbons. 

d) There are only three x energy levels. 

e) The MO coefficients on the central pair of carbons are larger in all of the z MOs 
than they are in the neutral molecule. 
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Chapter 9 


Matrix Formulation of the 
Linear Variation Method 


E 9-1 Introduction 


In Chapter 7 we developed a method for performing linear variational calculations. The 
method requires solving a determinantal equation for its roots, and then solving a set of 
simultaneous homogeneous equations for coefficients. This procedure is not the most 
efficient for programmed solution by computer. In this chapter we describe the matrix 
formulation for the linear variation procedure. Not only is this the basis for many 
quantum-chemical computer programs, but it also provides a convenient framework 
for formulating the various quantum-chemical methods we shall encounter in future 
chapters. 

Vectors and matrices may be defined in a formal, algebraic way, but they also 
may be given geometric interpretations. The formal definitions and rules suffice for 
quantum-chemical purposes. However, the terminology of matrix algebra is closely 
connected with the geometric ideas that influenced early development. Furthermore, 
most chemists are more comfortable if they have a physical or geometric model to 
carry along with mathematical discussion. Therefore, we append some discussion of 
geometrical interpretation to the algebraic treatment. 


| J 9-2 Matrices and Vectors 
9-2.A Definitions 


A matrix is an ordered array of elements satisfying certain algebraic rules. We write our 
matrices with parentheses on the left and right of the array.” Unless otherwise stated, 
we restrict the elements to be numbers (which need not be real). In general, however, 
as long as the rules of matrix algebra can be observed, there is no restriction on what 
the elements may be. In expression (9-1), we have written a matrix in three ways: 


1:2 3.8 —4.0 Cll CI2 C13 
j 1. 5 
5.0 0 0 =| 622 3) _¢ (9-1) 
9.1 0.0 —3+4i C31 C32 C33 
6.0 —1.0 —1.0 C41 C42 €43 


I More thorough discussions of matrix algebra at a level suitable for the nonspecialist are given by Aitken [1] 
and Birkhoff and MacLane [2]. 
2Some authors use brackets. 
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On the left, the numerical elements are written explicitly. In the center they are sym- 
bolized by a subscripted letter. On the right the entire matrix is indicated by a single 
symbol. We will use sans serif, upper-case symbols to represent matrices. 

Individual matrix elements will be symbolized either by a subscripted lower-case 
symbol (e.g., c12) or by asubscripted symbol for the matrix in parentheses (e.g., (C)12). 

Itis useful to recognize rows and columns in a matrix. The sample matrix given above 
has four rows and three columns, so itis said to have dimensions 4 x 3. When subscripts 
are used to denote position in a matrix, the convention is that the first subscript indicates 
the row, the second indicates the column. (The order “row-column” is important to 
remember. The mnemonic “RC,” or “Roman Catholic" is helpful.) Rows are numbered 
from top to bottom, columns from left to right. There is no limit on the dimensions for 
matrices, but we usually will be concerned in a practical way with finite-dimensional 
matrices in this book. 

If a matrix has only one column or row, it is called a column vector or row vector, 
respectively. We will use sans serif, lower-case symbols to denote column vectors. 
Additional symbols, described shortly, will be used to denote row vectors. These two 
kinds of vector behave differently under the rules of matrix algebra, so it is important 
to avoid confusing them. 

If a matrix has only one row and one column, its behavior under the rules of matrix 
algebra becomes identical to the familiar behavior of ordinary scalars (1.e., numbers), 
and so a 1 x 1 matrix is simply a number. 

The similarity in appearance between a matrix and a determinant may be deceptive. 
A determinant is denoted by bounding with vertical straight lines, and is equal to 
a number that can be found by reducing the determinant according to a prescribed 
procedure (see Appendix 2). For this to be possible, the determinant must be square 
(i.e., have the same number of rows as columns). A matrix is not equal to a number 
and need not be square. (However, one can take the determinant of a square matrix A. 
This number is symbolized |A| and is not the same as A without the vertical bars.) 

Two matrices are equal if all elements in corresponding positions are equal. Thus, 
A =B means aj; = bij for all i and j. 


9-2.B Complex Conjugate, Transpose, and Hermitian Adjoint 
of a Matrix 


We define the complex conjugate of a matrix A to be the matrix A*, formed by replacing 
every element of A by its complex conjugate. If A = A*, A is a real matrix. (Every 
element is real.) » 

We define the transpose of a matrix A to be the matrix A, formed by interchanging 
row 1 and column 1, row 2 and column 2, etc. The transpose of the 4 x 3 matrix in 
expression (9-1) is the 3 x 4 matrix given in 


1.2 5.0 91 6.0 
C= 3.8 1.0 0.0 —1.0 (9-2) 
—40 0.0 —3+4i —1.0 


If we denote some column vector as p, we can symbolize the corresponding row vector 
as D. Thus, the tilde symbol is one device we can use to indicate a row vector. 
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Transposing a square matrix corresponds to “reflecting” it through its principal 
diagonal (which runs from upper left to lower right) as indicated in 


2: 19 14 7 
5 6|, A=]2 5 8 (9-3) 
8 9 369 


IfA— A, Aisa symmetric matrix. 

We define the hermitian adjoint of A to be the matrix At, formed by taking the 
transpose of the complex conjugate of A (or the complex conjugate of the transpose. 
The order of these operations is immaterial. Hence, At= (A)* = (A*). If A — A1, 
A is a hermitian matrix. 


9-2.C Addition and Multiplication of Matrices and Vectors 


Multiplication of a matrix by a scalar is equivalent to multiplying every element in the 
matrix by the scalar. Addition of two matrices is accomplished by adding elements in 
corresponding positions in the matrices. Thus, for example, 


12) ,(5 6) (1410 2412) (t1 14 
3 4 7 8] \34+14 4-16] \17 20 


and it is evident that the operation of matrix addition is possible only within sets of 
matrices of identical dimensions. 

Matrix multiplication is a bit more involved. We start by considering multiplication 
of vectors. Two types of vector multiplication are possible. If we multiply a row vector 
on the left times a column vector on the right, we take the product of the leading element 
of each plus the product of the second element of each, plus . . ., etc., thereby obtaining 
a scalar as a result. Hence, this is called scalar multiplication of vectors. For example, 


4 
(1 2 3) St ees as edo 
6 


This kind of multiplication requires an equal number of elements in the two vectors. 
It is important to retain in mind the basic operation described here: summation of 
products taken by sweeping across a row on the left and down a column on the right. 
Since there exists but one row on the left and one column on the right, we obtain one 
number as the result. 

The other possibility is to multiply a column vector on the left times a row vector on 
the right. Employing the same basic operation as above, we sweep across row 1 on the 
left and down column 1 on the right, obtaining a product that we will store in position 
(1, 1) of a matrix to keep track of its origin. The product of row 1 times column 2 gives 
us element (1, 2) and so on. In this way, we generate a whole matrix of numbers. For 


example, 
1 1x3 1x4 1x5 3.4 5 
4 € — 
()6 5) p. 2x4 b l 8 S 
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This is an example of matrix multiplication of vectors. Just as before, the number of 
columns on the left (one) equals the number of rows on the right. Now, however, the 
number of elements in the vectors may differ, and the dimensions of the matrix reflect 
the dimensions of the original vectors. 

The two types of vector multiplication may be symbolized as follows, using our 
notation for scalars, row vectors, column vectors, and matrices: 


ab=c (9-4) 
ab =C (9-5) 


Multiplying two matrices together is most simply viewed as scalar multiplying all 
the rows in the left matrix by all the columns of the right matrix. Thus, in AB=C, the 
element c;; is the (scalar) product of row i in A times column j in B. This process is 
possible only when the number of columns in A equals the number of rows in B. Thus, 
AB may exist as a matrix C, while BA may not exist due to a disagreement in number of 
rows and columns. If A and B are both square matrices and have equal dimension, then 
AB and BA both exist, but they still need not be equal. That is, matrix multiplication 
is not commutative. 

In a triple product of matrices, ABC, one can multiply AB first (call the result D) 
and then multiply DC to get the final result (call it F). Or, if one takes BC = E, then 
one always finds AE = F. Thus, the result is invariant to the choice between (AB)C or 
A(BC), and so matrix multiplication is associative. 


9-2.D Diagonal Matrices, Unit Matrices, and Inverse Matrices 


A diagonal matrix is a square matrix having zeros everywhere except on the princi- 
pal diagonal. Diagonal matrices of equal dimension commute with each other, but a 
diagonal matrix does not, in general, commute with a nondiagonal matrix. 

A unit matrix 1 is a special diagonal matrix. Every diagonal element has a value 
of unity. A unit matrix times any matrix (of appropriate dimension) gives that same 
matrix as product. That is, 1A = A1 =A. It follows immediately that the unit matrix 
commutes with any square matrix of the same dimension. 

We define the left inverse of a matrix A to be A^, satisfying the matrix equation 
A~'A=1. The right inverse is defined to satisfy AAT! =1. 

In most of our quantum-chemical applications of matrix algebra, we will be con- 
cerned only with vectors and square matrices. For square matrices, the left and right 
inverses are identical, and so we refer simply to the inverse of the matrix. 


9-2.E Complex Conjugate, Inverse, and Transpose 
of a Product of Matrices 


If AB =C, then C* = (AB)* = A*B*. In words, the complex conjugate of a product 
of matrices is equal to the product of the complex conjugate matrices. This is demon- 
strable from the observation that (C);; = (A)i1(B)1; + (A)i2(B)2; +--+ and (Oy; = 
(A); (071; + (A)j(B)3; +--- and so the complex conjugate is produced by taking 
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the complex conjugate of every element in A and B but not changing their order of 
combination. 

If AB =C, then C^! = (AB)! =B~'A™?. In words, the inverse of a product 
of matrices is equal to the product of inverses, but with the order reversed. We can 
easily show that this satisfies the rules of matrix algebra. C^ !C = (AB)! AB = 
B-'A-'AB=B-'1B=B~'B =1. If we failed to reverse the order, we would instead 
have A-!B- (AB and, because the matrices do not commute, we would be prevented 
from carrying through the reduction to 1. 

If AB =C, then C = (AB) = BA. The transpose of a product is the product of 
transposes, again in reverse order. Since C has cj; and c j; interchanged, it follows that, 
where we had row i of A times column j of B, we must now have row j of A times 
column i of B. But this is the same as column j of A times row i of B. To obtain row 
on left and column on right for proper multiplication, we must have BA. 


9-2.F A Geometric Model 


Consider a vector in two-dimensional space emanating from the origin of a Cartesian 
system as indicated in Fig. 9-1a. We can summarize the information contained in this 
vector (magnitude and direction) by writing down the x and y components of the vector 
terminus, (3 2) in this case. It must be understood that the first number corresponds to 
the x component and not the y, and so the vector (3 2) carries its information through 
number position as well as number value. 

If we multiply both numbers in the vector by 2, the result, (6 4), corresponds to a 
vector collinear to the original but twice as long. Therefore, multiplying a vector by a 
number results in a change of scale but no change in direction. Hence, the term "scalar" 
is often used in place of “number” in vector terminology. 

Suppose that we rotated the Cartesian axes counterclockwise through an angle 0, 
maintaining them orthogonal to each other and not varying the distance scales. We 
imagine our original vector to remain unrotated during this coordinate transformation. 
(Equivalently, we can imagine rotating the vector clockwise by 0, keeping the axes 
fixed.) We wish to know how to express our vector in the new coordinate system. The 
situation is depicted in Fig. 9-1b. Inspection reveals that the new coordinates (x' y^) are 
related to the old (x y) as follows: 


x'—xcos0--ysinÓ, y =-—x sin + ycosó (9-6) 


If we make use of matrix algebra, we can express Eqs. (9-6) as a matrix equation: 


(5 -( Dus 2 (; (9-7) 
y —sin@ cos y 


Multiplying the two-dimensional vector (call it v) by the 2 x 2 matrix (call it R) gen- 
erates a new vector V’, that gives the coordinates of our vector in the new coordinate 
system: 


v’=Rv (9-8) 
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(b) 


Figure 9-1 » (a) The vector (3 2). (b) The same vector and its relationship to two Cartesian axis 
systems. 


We have, then, a parallel between the vectors v’ and v and matrix R on the one hand, 
and the two-dimensional “geometrical” vector and rotating coordinate system on the 
other. R represents the rotation and is often referred to as a rotation matrix. 

If we were to perform the rotation in the opposite direction, the rotation matrix would 
be the same except for the sin 0 terms, which would reverse sign: 


rotation of coordinates cos@ —sinO 
: —|. (9-9) 
clockwise by 0 sin 0 cos 6 


Note that this is just R, the transpose of R. 
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If we were to rotate counterclockwise by 0 and then clockwise by 6, we should end 
up with our original coordinates for the vector. Thus, we should expect 


RRv =v (9-10) 
or 
RR - 1 (9-11) 


(Note that the order of operations is consistent with reading from right to left, just as 
with differential operators. Thus, RR means that first R is performed, then R.) Relation 
(9-11) is easily verified by explicit multiplication. Thus we see that, if we think of a 
matrix as representing some coordinate transformation in geometrical space, the inverse 
of the matrix represents the reverse transformation. In this particular example, the 
transpose of the transformation matrix turns out to be the inverse transformation matrix. 
When this is so, the matrix is said to be orthogonal. (Orthogonal transformations do not 
change the angles between coordinate axes; orthogonal axes remain orthogonal—hence 
the name "orthogonal.") The analogous transformation for matrices having complex 
or imaginary coefficients is called a unitary transformation. A unitary matrix has its 
hermitian adjoint as inverse: AA = 1. 

While it is easy to visualize a coordinate transformation in two or three dimensions, 
itis more difficult in higher-dimensional situations. Nevertheless, the mathematics and 
terminology carry forward to any desired dimension and are very useful. One may, if 
one wishes, talk of vectors and coordinate transformations in hyperspace, or one can 
eschew such mental constructs and simply follow the mathematical rules without a 
mental model. 

One must be a bit cautious about inverses of matrices. In the rotation described 
above, we have a unique way of relating each x, y point in one coordinate system to a 
point at x’, y’ in the other. The transformation does not entail any loss of information 
and can therefore be “undone.” 

Such transformations (and their matrices) are called nonsingular. A nonsingular 
matrix is recognizable through the fact that its determinant must be nonzero. If we hada 
transformation which, for example, caused all or some points in one coordinate system 
to coalesce into a single point in the transformed system, we would lose our ability to 
back-transform in a unique way. Such a singular transformation has no inverse, and 
the determinant of a singular matrix equals zero. 


9-2.G Similarity Transformations 


A matrix product of the form A~'HA is called a similarity transformation on H. If 
A is orthogonal, then AHA is a special kind of similarity transformation, called an 
orthogonal transformation. If Ais unitary, then ATHA is a unitary transformation on H. 
There is a physical interpretation for a similarity transformation, which will be discussed 
in a later chapter. For the present, we are concerned only with the mathematical 
definition of such a transformation. The important feature is that the eigenvalues, or 
“Jatent roots,” of H are preserved in such a transformation (see Problem 9-5). 

In this section we have quickly presented the salient rules of matrix algebra and 
hinted at their connection with geometric operations. The results are summarized in 
Table 9-1 for ease of reference. 
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TABLE 9-1 » Some Matrix Rules and Definitions for a Square Matrix A of Dimension n 


At 

(ABC)* = A*B*C* 
(ABC) = GBA 
(ABO)! = CİBİ At 


(ABO)! 2C-!B-!A^! 
|ABC| = |A| - |B] -|C] 


T-!AT 


Matrix equality; means a;; = bij, i, j=1,n 
Matrix addition; cj; —ajj-- bij, i, j—l,n 
Multiplication of A by scalar; b;; =c-aj;, i, j=1,n 
Matrix multiplication; cj; = 3 5 ,aibyj, i, j—1l,n 
The determinant of the matrix A (see Appendix 2) 
The inverse of A; A A— AA 1-1 
If AT! exists, A is nonsingular and |A| Z 0. 
The complex conjugate of A; a;; — a 
If A* =A, A is real. 
The transpose of A; (A); j —aji (rows and columns 
inter changed) 


* 
ey 


i i, j—l,n 


If A =A, symmetric; if A= —A, antisymmetric; 
if A = A7, orthogonal. 
The hermitian adjoint of A; (Ab = a5; (At = A*) 
If Ai =A, hermitian. If A! =A7!, unitary. 
Complex conjugate of product 
Transpose of product 
Hermitian adjoint of product 
Inverse of product 
Determinant of product (any order) 
A similarity transformation 
If T-! — T7, this is a unitary transformation. 
If T-! — T, this is an orthogonal transformation. 
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We have seen that the independent-electron approximation leads to a series of MOs 
for a molecular system. If the MOs are expressed as a linear combination of n basis 
functions (which are often approximations to AOs, although this is not necessary), the 
variation method leads to a set of simultaneous equations: 


(Hii — ES11)01 + (Hio — ES12)02 +--+ (Hin — E Sin)Cn = 


(Ani — E Sn1)c1 


| 
o 


(9-12) 


(Hnn — ESnn)en = 0 


All terms have been defined in Chapter 7. Given a value for £ that satisfies the associated 
determinantal equations, we can solve this set of simultaneous equations for ratios 
between the c;'s. Requiring MO normality establishes convenient numerical values for 


the c;'s. 
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A matrix equation equivalent to Eq. (9-12) is? 


€] 0 

Hi — ES}; Hi — ESi? +++ Hy — ESy, 5 0 
: =]. (9-13) 

Ani — E Sn1 o Ann — E Snn f E 

Cn 0 


The matrix in Eq. (9-13) is clearly the difference between two matrices. This enables 
us to rewrite the equation in the form 


C] CI 
Hy — Hi: Ain Sit $1 Sin 

C2 C2 
: l .[5£[ : : : (9-14) 

Ant oe Ann ` Snı aa Snn ` 

Cn n 

or 

Hc; = £;Sc;, i —1,2,...,n (9-15) 


where we have introduced the subscript i to account for the fact that there are many 
possible values for E and that each one has its own characteristic set of coefficients. Note 
that the “eigenvector” Cj is a column vector and that each element in C; is (effectively) 
multiplied by the scalar E; according to Eq. (9-15). 

In general, there are as many MOs as there are basis functions, and so Eq. (9-15) 
represents n separate matrix equations. We can continue to use matrix notation to 
reduce these to a single matrix equation. We do this by stacking the n C vectors together, 
side by side, to produce an n x n matrix C. The numbers E must also be combined 
into an appropriate matrix form. We must be careful to do this in such a way that the 
scalar E; still multiplies only Cı (now column 1 of C) E2 multiplies only C2, and so 
forth. This is accomplished in the following equation 


Hy +++ Hin) (ci +++ Cin 


Ei 0 0 0 
S S 
11 ln C1 Cin 0 E> 0 0 
= : E. o (9-16) 
Srl Sun Cn] Cnn 0 Ü 0 E, 
or 
HC =SCE (9-17) 


The matrix E is a diagonal matrix of orbital energies (often referred to as the matrix 
of eigenvalues). C is the matrix of coefficients (or matrix of eigenvectors), and each 


3 Quantum-chemical convention is to use upper case letters for individual elements of the matrices H, S, and E. 
This differs from the usual convention. 
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column in C refers to a different MO. The first column refers to the MO having energy 
E. In multiplying E by C from the left, each coefficient in column 1 becomes multi- 
plied by Z|. This would not occur if we multiplied E by C4 from the right. Therefore, 
HC = SCE is correct, whereas HC = ESC is incorrect. 


N 9-4 Solving the Matrix Equation 


Since we know the basis functions and the effective hamiltonian (in principle, at least), 
we are in a position to evaluate the elements in H and S. How do we then find C and E? 

Let us first treat the simplified situation where our basis set of functions is orthonor- 
mal, either by assumption or design. Then all the off-diagonal elements of S (which 
correspond to overlap between different basis functions) are zero, and all the diago- 
nal elements are unity because of normality. In short, S — 1. Therefore, Eq. (9-17) 
becomes 


HC-CE (9-18) 


and our problem is, given H, find C and E. 

Now, we want a set of coefficients that correspond to normalized MOs. We have 
seen earlier that, for an orthonormal basis set, this requires each MO to have coefficients 
satisfying the equation (assuming real coefficients) 


cr -edadd-l (9-19) 


We can write this as a vector equation 


Gigi = (en ense cin) |. | 71 (9-20) 


Cni 


Furthermore, we know that any two different MOs must be orthogonal to each other. 
That is C;C; = 0, i # j. All this may be summarized in the matrix equation 


CC -1 (9-21) 


Hence, the coefficient matrix is orthogonal. In the more general case in which coeffi- 
cients may be complex, C is unitary; i.e., C'C — 1. Our problem then is, given H, find 
a unitary matrix C such that HC = CE with E diagonal. 

We can multiply both sides of a matrix equation by the same matrix and preserve the 
equality. However, because matrices do not necessarily commute, we must be careful 
to carry out the multiplication from the left on both sides, or from the right on both 
sides. Thus, multiplying Eq. (9-18) from the left by Ct, we obtain 


CHC =C'CE=1E=E (9-22) 


where we have used the fact that C is unitary. Now our problem may be stated as, 
given H, find a unitary matrix C such that C'HC is diagonal^ Several techniques 


4Not every matrix (not even every square matrix) can be diagonalized by a unitary transformation, but every 
hermitian matrix can be so diagonalized. 
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exist for finding such a matrix C. These are generally much more suitable for machine 
computation than are determinantal manipulations. 

We can illustrate that the allyl radical energies and coefficients already found by the 
HMO method do in fact satisfy the relations C'C = 1 and C'HC =E. The matrix C 
can be constructed from the HMO coefficients and is 


l L l 
E NC 
C= T 0 — Ta (9-23) 
1 1 1 
2 "^w 2 
Therefore 
l L 1 l L 1 
2 7 2 2 A 2 
op & = L eli 
Cc-|z " “Bally 9 5 
lb ld 1 Pose (d 
2 Z 2 2 Z 2 
100 
0 1 0 (9-24) 
0 0 1 
The matrix H for the allyl radical is, in HMO theory, 
a p O 
H=| 6 a B (9-25) 
0 Ba 
The reader should verify that 
ad X28 0 0 
C'HC- 0 a 0 =E (9-26) 
0 0 a—428 


The diagonal elements can be seen to correspond to the HMO energies. Note that the 
energy in the (1 1) position of E corresponds to the MO with coefficients appearing in 
column 1 of C, illustrating the positional correlation of eigenvalues and eigenvectors 
referred to earlier. 

If the basis functions are not orthogonal, S Æ 1 and the procedure is slightly more 
complicated. Basically, one first transforms to an orthogonal basis to obtain an equation 
of the form H'C' 2 C'E. One diagonalizes H' as indicated above to find E and C’, 
where C’ is the matrix of coefficients in the orthogonalized basis. Then one back 
transforms C' into the original basis set to obtain C. There are many choices available 
for the orthogonalizing transformation. The Schmidt transformation, based on the 
Schmidt orthogonalization procedure described in Chapter 6, is popular because it is 
very rapidly performed by a computer. Here we will simply indicate the matrix algebra 
involved. Let the matrix that transforms a nonorthonormal basis to an orthonormal one 
be symbolized A. This matrix satisfies the relation 


ASA =1 (9-27) 
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Furthermore, |A| 40 and so AT! exists. We can insert the unit matrix (in the form 
AAT!) wherever we please in the matrix equation HC = SCE without affecting the 
equality. Thus, 


HAA~'C = SAA~'CE (9-28) 
Multiplying from the left by Ai gives 
A'HAA C =ATSAA~'CE (9-29) 
By Eq. (9-27), this reduces to 
(A'HA)(A~'C) = (A! OE (9-30) 


where the parentheses serve only to make the following discussion clearer. If we define 
AtHA to be H’, and A~'C to be C’, Eq. (9-30) becomes 


HOSCE (9-31) 


Since we know the matrix H and can compute A from knowledge of S, it is possible 
to write down an explicit H' matrix for a given problem. Then, knowing H' (which 
is just the hamiltonian matrix for the problem in the orthonormal basis), we can seek 
the unitary matrix C’ such that C'! H'C' is diagonal. These diagonal elements are our 
orbital energies. (Note that E in Eq. (9-31) is the same as E in HC = SCE.) To find 
the coefficients for the MOs in terms of the original basis (i.e., to find C), we use the 
relation 


AC’=A(A~'C) 21C2C (9-32) 


One nice feature of this procedure is that, even though we use the inverse matrix A^ in 
our formal development, we never need to actually compute it. (A and A! are used to 
find H', and A is used to find C.) This is fortunate because calculating inverse matrices 
is a relatively slow process. 

A few more words should be said about the process of diagonalizing a hermitian 
matrix H with a unitary transformation. Two methods are currently in wide use. The 
older, slower method, known as the Jacobi method, requires a series of steps on the 
starting matrix. In the first step, a matrix O4 is constructed that causes the largest 
off-diagonal pair of elements of H to vanish in the transformation Hy = O4HO4. Now 
a second transformation matrix Oo is constructed to force the largest off-diagonal pair 
of elements in H4 to vanish in the transformation O2H;O2 = O204HO40». This 
procedure is continued. However, since each transformation affects more elements in 
the matrix than just the biggest pair, we eventually *unzero" the pair that was zeroed in 
forming H1 or H2, etc. This means that many more transformations are required than 
there are off-diagonal pairs. Eventually, however, the off-diagonal elements will have 
been nibbled away (while the diagonal elements have been building up) until they are 
all smaller in magnitude than some preselected value, and so we stop the process. The 
transformation matrix C corresponds to the accumulated product O4O203.... 
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A more recently discovered, faster procedure is the Givens- Householder-Wilkinson 
method. Here, H is first tridiagonalized, which means that all elements are made to 
vanish except those on the main diagonal as well as on the codiagonals above and below 
the main diagonal. This similarity transformation can be done in a few steps, each 
step zeroing all the necessary elements in an entire row and column. The eigenvalues 
for the tridiagonal matrix (and hence for the original matrix) may be found one at a 
time as desired. If only the third lowest eigenvalue is of interest, that one alone can be 
computed. This is a useful degree of freedom which results in substantial savings of 
time. Once an eigenvalue is found, its corresponding eigenvector may be computed. 


[p 9-5 Summary 
The steps to be performed in a matrix solution for a linear variation calculation are: 
1. From the basis set, calculate the overlap matrix S. 
2. From the basis set and hamiltonian operator, calculate the hamiltonian matrix H. 


3. If S Z 1, find an orthogonalization procedure. In the Schmidt method, A is such 
that A'SA = 1. The matrix equation may now be written in the form H'C/ = C'E. 


4. Find C’ such that C’/'H’C’ is a diagonal matrix. The diagonal elements are the 
roots E. 


5. If necessary, back transform: AC’ = C. The columns of C contain the MO coeffi- 
cients appropriate for the original basis set. 


9-5.A Problems 


9-1. Evaluate the following according to the rules of matrix algebra: 


a (6 7 (1 


b) UA a b c) 
7 
—1 3 
d) cos@é —sin@ cos@ sing 
sin 0 cos 0 —sin@ cosé 


ce 
$ 
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9-2. 
9-3. 


9-4. 


9-6. 
9-7. 


9-8. 


2. 5 i 4 
6 2b 
0 —3 

cosÜ -—sinO 


g) 


sin cos 0 


If Hj; = f X; H x; dt and Él is hermitian, show that H is a hermitian matrix. 


A-[9u 940 B= bii b2 
a2, 022 bzi b22 


Show that, in general, AB 4 BA. 


Let 


Let 
100 4 0 0 10 1 
A—-|[02 0], B=]0 5 0], andC=]0 1 O 
0.0 3 0 0 6 10 1 


Show that AB = BA, but AC zz CA. Compare the matrix AC with CA. Do these 
matrices show any simple relationship? Can you relate this to properties of A and 
C mathematically? 


. The “latent roots" A; of A are solutions to the equation |A — 4;1| = 0, 


i —1,2,...,n, wheren is the dimension of A. 


a) Show that, under a similarity transformation B — T-!AT, the latent roots are 
preserved. 

b) Demonstrate that diagonalization of A via a similarity transformation produces 
the latent roots as the diagonal elements. 


Show that, if a matrix has any latent roots equal to zero, it has no inverse. 
The trace (or spur) of a matrix is the sum of the elements on the principal diagonal. 
Thus, tr A= 27 4ai;. 


a) Show that the trace of a triple product of matrices is invariant under cyclic 
permutation. That is, tr (ABC) = tr (CAB) = tr(BCA) but not tr(CBA). 
b) Show that the trace of a matrix is invariant under a similarity transformation. 


The norm of a matrix is the positive square root of the sum of the absolute squares 
of all the elements. 
For a real matrix A, 

1/2 1/2 


norm A — »» gl US (A), (A) j.; 


i,j-l i j 
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9-9, 


9-10. 


9-11. 


9-12. 


9-13. 


9-14. 
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Prove that the norm of a real matrix is preserved in an orthogonal transformation 
(or, you may prefer to prove that the norm of any matrix is preserved in a unitary 
transformation). 


Use the facts that the trace, the determinant, and the norm of a matrix are invari- 
ant under an orthogonal transformation to find the eigenvalues of the following 
matrices: 


0 1 I 
a [10 1 
1 1 0 
l1 1 1 
2 2 2 
1 1 
b| 0-4 
l1 tL 1 
2 2 2 


Consider the matrix 
cos0 0 
—sind O0 


] . ] 3 

What is the effect of this transformation on 5 ? On ? Can the transfor- 
mation be uniquely reversed? (That is, for, say, 0 — 0, and given a transformed 
vector , can one uniquely determine the vector this was transformed from?) 


Does the matrix have an inverse? Evaluate its determinant. 


What are the eigenvectors for the matrix 
1 0 0 
H-|]0-3 0 
0 0 -2 


Show that, if A and B have “simultaneous eigenvectors” (i.e., both diagonalized 
by the same similarity transformation), then A and B commute. 


If HC 2 CE, and C! C = 1, then CTHC =E, and we seek a unitary transformation 
that diagonalizes H. If HC = SCE, and C!SC = 1, then C'HC = CİSCE = 
1E, and C'HC =E. Since this is the same working equation as the one we 
found above, why do we not proceed in the same way? Why do we bother 
orthogonalizing our basis first? 


We have mentioned that a matrix may be used to represent the rotation of coor- 
dinates by some angle 0. Such a rotation is a geometric operation, so we have, 
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in effect, represented an operator with a matrix. It is possible to represent other 
operators in a similar way. Indeed, an alternative approach to quantum mechan- 
ics exists in which the whole formalism is based on matrices and their properties 
(matrix mechanics, as opposed to wave mechanics). A particularly interesting 
example is provided by the matrices constructed by Pauli to represent spin oper- 
ators and functions. It was mentioned in Chapter 5 that spin functions o and f 
satisfy rules similar to those for orbital angular momentum. Two of these are 


But it was pointed out that œ and f could not be expressed in terms of spherical 
harmonics. Pauli represented this operator and functions by 


^, 1/1 
S,== : 
2\0 -1 
Using these definitions, show that 


[etbdo= | piadw=o, IDE 
1 


S = 7a, S =. ee 
a 5” p "id 


[Note: since o and £ are essentially the Dirac delta functions in the spin coordi- 
nate c, the process of integration reduces here to scalar multiplication of vectors. ] 
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Chapter 10 


The Extended Hückel Method 


[| 10-1 The Extended Hückel Method 


The extended Hückel (EH) method is much like the simple Hückel method in many 
of its assumptions and limitations. However, it is of more general applicability since 
it takes account of all valence electrons, o and x, and it is of more recent vintage 
because it can only be carried out on a practical basis with the aid of a computer. The 
basic methods of extended Hückel calculations have been proposed at several times by 
various people. We will describe the method of Hoffmann [1], which, because of its 
systematic development and application, is the EHMO method in common use. 

The method is described most easily by reference to an example. We will use 
methane (CH4) for this purpose. 


10-1.A Selecting Nuclear Coordinates 


The first choice we must make is the molecular geometry to be used. For methane, we 
will take the H-C-H angles to be tetrahedral and C-H bond distances of 1.1 A. We can 
try altering these dimensions later. 

Cartesian coordinates for the five atoms are listed in Table 10-1, and the orientation 
of the nuclei in Cartesian space is indicated in Fig. 10-1. (Even though the eigenvalues 
and MOs one finally obtains are independent of how CHA is oriented in Cartesian 
space,! it is generally a good idea to choose an orientation that causes some Cartesian 
and symmetry axes to coincide. The resulting expressions for MOs in terms of AOs 
are generally much simpler to sketch and interpret.) 


10-1.B The Basis Set 


Next we must select the basis set of functions with which to express the MOs. The 
extended Hückel method uses the normalized valence AOs for this purpose. For CH4, 
this means a 1s AO on each hydrogen and a 2s, 2px, 2p), and 2p; AO on carbon. The 
inner-shell 1s AO on carbon is not included. The AOs are represented by Slater-type 
orbitals (STOs). Except for the 1s AOs on hydrogen, the exponential parameters of the 
STOs are determined from Slater’s rules (Chapter 5). Various values for the hydrogen 
1s AO exponent have been suggested. These have ranged from the 1.0 given by Slater's 


lt sometimes happens that an approximation is made that causes the solution to depend on orientation. This is 
called “loss of rotational invariance.” 
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TABLE 10-1 »  Cartesian Coordinates (in Angstroms) for Atoms of 
Methane Oriented as Shown in Fig. 10-1 


Atom x y Z 

C 0.0 0.0 0.0 

Ha 0.0 0.0 1.1 

Hy 1.03709 0.0 —0.366667 
He —0.518545 0.898146 —0.366667 
Ha —0.518545 —0.898146 —0.366667 


———N 


Figure 10-1 » Orientation of methane in a Cartesian axis system. 


rules to a value of 4/2. We will use a value of 1.2, which is near the optimal value for 
H2 (see Chapter 7). The STOs for methane are listed in Table 10-2. 


10-1.C The Overlap Matrix 


Knowing the AO functions and their relative positions enables us to calculate all of 
their overlaps. This would be a tedious process with pencil and paper. However, the 
formulas have been programmed for automatic computation, so this step is included in 
any EH computer program.” The computed overlap matrix for the methane molecule 
is shown in Table 10-3. 

This matrix is symmetric (since the overlap between two AOs is independent of their 
numbering order) and has diagonal elements of unity since the AOs are normalized. 
The zero values in the first four rows and columns reflect the orthogonality between the 
s and all p AOs on carbon. Other zero values result when hydrogen 1s AOs are centered 
in nodal planes of carbon p AOs. The geometry of the system is clearly reflected in the 
overlap matrix. For instance, the overlap of the 2p, AO of carbon with the hydrogen 
1s AO at Hy is large and positive, while its overlaps with AOs on Hp, He, and Hg are 
negative, equal, and of smaller magnitude. The 2s AO of carbon, on the other hand, 
overlaps all 1s AOs equally. Also, the overlap between every pair of hydrogen 1s AOs 


2Several such programs are available from Quantum Chemistry Program Exchange, Chemistry Dept., Room 204, 
Indiana University, Bloomington, Indiana 47401. http://www.QCPE.Indiana.edu 
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TABLE 10-2 » Basis AOs for Methane 


AO no. Atom Type n? I? m^ exp 


1 C 2s 2 0 0 1.625 
2 C 2p; 2 1 0 1.625 
3 C 2px 2 1 (1)? 1.625 
4 C 2py 2 1 (1) 1.625 
5 H, Is 1 0 0 1.200 
6 Hp Is 1 0 0 1.200 
7 He Is 1 0 0 1.200 
8 Ha Is 1 0 0 1.200 


an, l,m are the quantum numbers described in Chapter 4. 
bony and 2p, are formed from linear combinations of m = +1 and m = — 1 STOs, and 
neither of these AOs can be associated with a particular value of m. 


TABLE 10-3 » Overlap Matrix for STOs of Table 10-2 


1 2 3 4 2 6 7 8 

1 1.0000 0.0 0.0 0.0 0.5133 0.5133 0.5133 0.5133 
2 0.0 1.0000 0.0 0.0 0.4855 —0.1618 —0.1618 —0.1618 
3 0.0 0.0 1.0000 0.0 0.0 0.4577 —0.2289 —0.2289 
4 0.0 0.0 0.0 1.0000 0.0 0.0 0.3964 —0.3964 
5 0.5133 0.4855 0.0 0.0 1.0000 0.1805 0.1805 0.1805 
6 0.5133 —0.1618 0.4577 0.0 0.1805 1.0000 0.1805 0.1805 
7 0.5133 —0.1618 —0.2289 0.3964 0.1805 0.1805 1.0000 0.1805 
8 0.5133 —0.1618 —0.2289 —0.3964 0.1805 0.1805 0.1805 1.000 


is the same. Features such as these provide a useful check on the correctness of our 
initial Cartesian coordinates. 


10-1.D The Hamiltonian Matrix 


We have the overlap matrix S. Next we must find the hamiltonian matrix H. Then we 
will be in a position to solve the equation HC — SCE for C and E. (See Chapter 9.) 
The matrix H is calculated from a very approximate but simple recipe. The basic ideas 
are similar in spirit to those described in connection with the interpretations of œ and 
P in the simple Hückel method. The energy integral Hj; in the EH method is taken to 
be equal to the energy of an electron in the ith AO of the isolated atom in the appro- 
priate state. The various ionization energies of atoms are known,” so this presents no 
great difficulty. However, one special problem must be dealt with, namely, finding the 


3See Moore [2]. 
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appropriate state. In the isolated carbon atom, the lowest-energy states are associated 
with the configuration 1s?2s?2p?. In a saturated molecule such as methane, however, 
carbon shares electrons with four hydrogens, and calculations indicate that the 2s and 
all three 2p AOs are about equally involved in forming occupied MOs. That is, in the 
molecule, carbon behaves as though it were in the 2s2p? configuration. This configu- 
ration (shortened to sp?) is referred to as the valence state of carbon in this molecule. 
Since this is an “open-shell” configuration (i.e., not all the electrons are spin-paired 
in filled orbitals), there are several actual physical states (corresponding to different 
spin and orbital angular momenta of an isolated carbon atom) that are associated with 
this configuration. Thus, there are several real physical states, with different ionization 
energies, associated with our mentally constructed sp? valence state for the atom in a 
molecule. The question is, what real ionization energies should we use to evaluate our 
valence state ionization energy (VSIE)? The approach that is used is simply to average 
the real IEs for loss of a 2p or a 2s electron, the average being taken over all states 
associated with the sp? configuration. Various authors recommend slightly different 
sets of VSIEs. We use here the values tabulated by Pople and Segal [7]. Because of 
the rather crude nature of the EH method, the slight variations in VSIE resulting from 
different choices are of little consequence.? For methane, we have 


(Cos): Hy, = —19.44eV = —0.7144 a.u. (10-1) 
(Cop): H22 = H33 = H44 = —10.67 eV = —0.3921 a.u. (10-2) 
(His): Hss = Hag = A= Hgg = — 13.60 eV = —0.50000 a.u. (10-3) 


The off-diagonal elements of H are evaluated according to? 


d 


2 (10-4) 


Aj =K Sij ( 
where K is an adjustable parameter. The rationalization for such an expression is that 
the energy of interaction should be greater when the overlap between AOs is greater, 
and that an overlap interaction energy between low-energy AOs should be lower than 
that produced by an equal amount of overlap between higher-energy AOs. We will 
discuss the energy versus overlap relation in more detail in a later section. The value 
of K suggested by Hoffmann [1] is 1.75. The reasons for choosing this value will be 
discussed shortly. For now, we accept this value and arrive at the hamiltonian matrix 
given in Table 10-4. 

By examining H we can guess in advance some of the qualitative features of the MOs 
that will be produced. For instance, the value of H25 (—0.3790 a.u.) indicates a strong 
energy-lowering interaction between the 2p; AO and the 1s AO on H4. This interaction 
refers to AOs with positive and negative lobes as they are assigned in the basis set. 
Therefore, we expect a low-energy (bonding) MO to occur where these AOs are mixed 
with coefficients of the same sign so as not to affect this AO sign relation. A high-energy 
MO should also exist where the mixing occurs through coefficients of opposite sign, 
producing an antibonding interaction. The values of Hs, H27, and Hog are positive, 


^See Skinner and Pritchard [3], Hinze and Jaffé [4], Basch et al. [5], Anno [6], and Pople and Segal [7]. 

The proper valence state for carbon in methane differs from that in ethylene, which in turn differs from that 
in acetylene. Generally, this is ignored in EHMO calculations and a compromise set of VSIEs is selected for use 
over the whole range of molecules. 

This formula is often called the Wolfsberg-Helmholtz relation. 
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TABLE 10-4 » The Extended Hückel Hamiltonian Matrix for CH4? 


1 2 3 4 5 6 7 8 
1 —0.7144 0.0 0.0 0.0 —0.5454 —0.5454 —0.5454 —0.5454 
2 0.0 —0.3921 0.0 0.0 —0.3790 0.1263 0.1263 0.1263 
3 00 0.0 —0.3921 0.0 0.0 —0.3573 0.1787 0.1787 
4 00 0.0 0.0 —0.3921 0.0 0.0 —0.3094 0.3094 
5 —0.5454 —0.3790 0.0 0.0 —0.5000 —0.1579 —0.1579 —0.1579 
6 —0.5454 0.1263 —0.3573 0.0 —0.1579 —0.5000 —0.1579 —0.1579 
7 —0.5454 0.1263 0.1787 —0.3094 —0.1579 —0.1579 —0.5000 —0.1579 
8 —0.5454 0.1263 0.1787 0.3094 —0.1579 —0.1579 —0.1579 —0.5000 


“All energies in a.u. 


due to negative overlap in corresponding positions of S. In this case, energy lowering 
will be associated with mixing 2p, with 1s AOs on hydrogens b, c, and d, but now 
the mixing coefficients will have signs that reverse the AO sign relations from those 
pertaining in the original basis. The AOs that are orthogonal have zero interaction, and 
so mixing between such AOs will not affect MO energies. (When such AOs are mixed 
in the same MO, it is often the result of arbitrary mixing between degenerate MOs. In 
such cases, one can find an orthogonal pair of MOs such that two noninteracting AOs 
do not appear in the same MO. Methane will be seen to provide an example of this.) 


10-1.E The Eigenvalues and Eigenvectors 


Having H and S, we now can use the appropriate matrix-handling programs to solve 
HC = SCE for the matrix eigenvalues on the diagonal of E and the coefficients for the 
MOs, which are given by the columns of C. The eigenvalues for methane, together 
with their occupation numbers, are given in Table 10-5. The corresponding coefficients 
are given in Table 10-6. 


TABLE 10-5 » Energies for Methane by the 


Extended Hückel Method 

MO no. Energy (a.u.) Occ. no. 
8 1.1904 0 

7 0.2068 0 

6 0.2068 0 

3 0.2068 0 

4 —0.5487 2 

3 —0.5487 2 

2 —0.5487 2 

1 —0.8519 2 
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TABLE 10-6 » Coefficients Defining MOs for Methane 


MO number 
1 2 3 4 5 6 7 8 
12s) 0.5842 0.0 0.0 0.0 0.0 0.0 0.0 1.6795 
2(2p;) 0.0 0.5313 —0.0021 —0.0007 —0.0112 —0.0137 1.1573 0.0 
3Qp,) 0.0 0.0021 0.5313 —0.0021 1.1573 —0.0178 0.0110 0.0 
4(2py) 0.0 0.0007 0.0021 0.5313 0.0176 1.1572 0.0139 0.0 


5(1s;) 0.1858 0.5547 —0.0022 —0.0007 0.0105 0.0128 —1.0846 —0.6916 
6(1s,) 0.1858 —0.1828 0.5237 —0.0019 —1.0260 0.0114 0.3518 —0.6916 
7(1s.) 0.1858 —0.1853 —0.2589 0.4542 0.4943 —0.8977 0.3558 —0.6916 
8(1s;) 0.1858 —0.1865 —0.2626 —0.4516 0.5213 0.8734 0.3770 —0.6916 


Only two of the eight MOs are nondegenerate. These two MOs must be symmet- 
ric or antisymmetric for every symmetry operation of the molecule. This is easily 
checked by sketching the MOs, referring to the coefficients in the appropriate columns 
of Table 10-6. The lowest nondegenerate energy occurs in position 1 of our eigenvalue 
list (Table 10-5), and so the coefficients for this MO are to be found in column 1 of 
Table 10-6. This column indicates that the MO 41 is equal to 0.5842 2s + 0.1858 1s, + 
0.1858 1s, 4- 0.1858 1s, 4- 0.1858 lsg, where the symbols 2s, 1s4, etc., stand for AOs 
on carbon, H4, etc. A sketch of this MO appears in Fig. 10-2. Itis obviously symmetric 
for all rotations and reflections of a tetrahedron. Notice that this MO is bonding in all 
four C-H bond regions since the 2s STO on carbon is in phase agreement with all the 
hydrogen 1s AOs. The higher-energy, nondegenerate MO 4$ is qualitatively similar to 
$1 except that the signs are reversed on the 1s AOs (see Table 10-6). Hence, this MO 
has the same symmetry properties as $1, but is antibonding in the C-H regions. 


e 


Figure 10-2 » A drawing of the lowest-energy nondegenerate EHMO for methane. The AOs are 
drawn as though they do not overlap. This is done only to make the drawing simpler. Actually, the 
AOs overlap strongly. 
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Since these two MOs resemble the 2s STO in being symmetric for all the symmetry 
operations of a tetrahedron, we will refer to them as s-type MOs. 

The remaining six MOs are grouped into two energy levels, each level being triply 
degenerate. (Itis possible to predict from symmetry considerations alone that the energy 
levels resulting from this calculation will be nondegenerate and triply degenerate. This 
is discussed in a later chapter) Because they are degenerate, the MOs cannot be 
expected to show all the symmetry of the molecule, but it should be possible for them to 
show some symmetry. Consider d», as given by column 2 of Table 10-6. This is mainly 
constructed from the 2p, AO on carbon and 1s AOs on the four hydrogens. Small 
contributions from 2p, and 2p, are also present, however. It would be nice to remove 
these small contributions and “clean up" the MO. We can do this, as mentioned earlier, 
by mixing ¢2 with appropriate amounts of $5 and $4 since these are all degenerate. 
The equations that our cleaned-up MO, $5, must satisfy are 


$^ = dod» + d3$3 + dada (10-5) 


where (in order to cause all 2p, and 2p, contributions to vanish) 


0.0021d2 + 0.5313d3 +—0.0021d4=0.0 
0.0007d2 +0.0021d3+ 0.5313d4=0.0 (10-6) 
d$ + d + di-1 
As a result, 
$5 — 0.99999» — 0.004095 — 0.001394 (10-7) 


A similar procedure to produce an orbital with no pz or p, contribution ($5), and one 
with no pz or px contribution (¢/,) gives 


$5 = 0.99993 + 0.00409» — 0.004064 (10-8) 
p4 = 0.99999, + 0.00139» + 0.003993 (10-9) 


The coefficients for these MOs appear in Table 10-7. 


TABLE 10-7 » Coefficients for MOs $5. $5. $, 


$5 5 4 
2s 0.0 0.0 0.0 
2pz 0.5313 0.0 0.0 
2px 0.0 0.5313 0.0 
2p, 0.0 0.0 0.5313 
1s, 0.5547 0.0 0.0 
Isp —0.1849 0.5228 0.0 
Isc —0.1849 —0.2614 0.4529 


Isa —0.1849 —0.2614 —0.4529 
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$;' $3" ba! 


Figure 10-3 » The three lowest-energy degenerate MOs of methane. 


There is no fundamental change produced by intermixing degenerate MOs in this 
way. The total electronic density and the orbital energies are uninfluenced. The only 
advantage is that the cleaned-up MOs are easier to sketch and visualize. The MOs $5, 
$5, and $, are sketched in Fig. 10-3. 

Each of the MOs in Fig. 10-3 is symmetric or antisymmetric for some of the opera- 
tions that apply to a tetrahedron. $» is symmetric for rotations about the z axis by 27/3, 
and also for reflection through the xz plane. This same reflection plane is a symmetry 
plane for ó5and $4, but neither of these MOs shows symmetry or antisymmetry for 
rotation about the z axis. Each MO contains one p AO and, perforce, has the symmetry 
of that AO. We shall refer to these as p-type MOs. Note that hydrogen 1s AOs lying in 
the nodal plane of a p AO do not mix with that p AO in formation of MOs. This results 
from zero interaction elements in H, which, in turn, results from zero overlap elements 
in S. Note also that the MO 4» is the MO that we anticipated earlier on the basis of 
inspection of the matrix H. Because of phase agreements between the hydrogen 1s 
AOs and the adjacent lobes of the p AOs, $!, $5, and $5 are C-H bonding MOs. 

A similar “cleaning up" procedure can be performed on $5, $6, and $7. These turn 
out to be the C-H antibonding mates to the MOs in Fig. 10-3. 

The broad results of this calculation are that there are four occupied C-H bonding 
MOs, one of s type and three of p type. At higher energies are four unoccupied C-H 
antibonding MOs, again one of s type and three of p type. 

Note that the s- and the three p-type MOs fall into the same energy pattern as the s and 
p AOs of isolated carbon. Because of their highly symmetric tetrahedral geometry, the 
hydrogen atoms do not lift the degeneracy of the p AOs. There are many molecules and 
complexes in which a cluster of atoms or molecules surrounds a central atom in such 
a highly symmetric way that the degeneracies among certain AOs on the central atom 
are retained. 


10-1.F The Total Energy 


The total EH energy is taken as the sum of the one-electron energies. For methane, this 
is 2 x (—0.8519) + 6 x (—0.5487), or —4.9963 a.u. There is some ambiguity as to how 
this energy is to be interpreted. For instance, does it include any of the internuclear 
repulsion energy? Also, what problems will arise from our neglect of inner-shell 
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electrons? By comparing EH total energy changes with experimental energy changes, 
it has been decided’ that the change in EH total energy upon change of geometry is 
approximately the same as the actual change in total electronic plus nuclear repulsion 
energy for the system. Thus, our value of —4.9963 a.u. is not a realistic value for 
the total (nonrelativistic) energy of methane (the actual value is —40.52 a.u.), but it 
is meaningful when compared to EH energies for methane at other geometries. For 
example, if we uniformly lengthen or shorten all the C-H bonds in methane and repeat 
our EH calculation several times, we can generate an energy curve versus Rc-g for 
the symmetrical stretch vibrational mode of methane. The resultant plot is given in 
Fig. 10-4. The EH total energy is minimized at about Rc-g = 1 À, reasonably close to 
the experimentally observed 1.1- A distance for the minimum total energy of methane. 

The appearance of the curve in Fig. 10-4 does encourage us to equate EH total energy 
changes to changes in actual electronic-plus-nuclear-repulsion energies. As we shall 
see later, this procedure fails for some molecules (notably H2) and for methane may be 
regarded as fortuitous. 


E (a.u.) 


0.0 04 0.8 1.2 1.6 20 24 28 
Rc. X (À 


Figure 10-4 » Total extended Hückel energy for CH4 as a function of C-H bond length. 


7See Hoffmann [1]. 
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Figure 10-5 » Staggered ethane MO energies versus K. C-H distances are 1.1 A; C-C distance 
is 1.54 A; all angles are tetrahedral. 


10-1.G Fixing the Parameter K 


We mentioned earlier that Hoffmann suggested a value of 1.75 for K. We will now 
indicate the considerations behind this suggestion. 

Hoffmann used the ethane molecule C2H6 to evaluate K. A plot of the orbital 
energies of staggered ethane as a function of K is shown in Fig. 10-5. The energies are 
linearly dependent on K at values of K greater than about 1.5. At lower values, the 
lines curve and some crossing occurs. Hence, in order that the MO energy order not be 
highly sensitive to K, its value should exceed 1.5. A plot of the amount of electronic 
charge in a bond or at an atom in ethane, as calculated by the EHMO method, versus K 
is shown in Fig. 10-6. We will describe the details of such calculations shortly, but for 
now, we merely note that the disposition of charge in ethane becomes rather insensitive 
to K at values of K greater than about 1.5. In Fig. 10-7 a plot of the EH total energy 
difference between staggered and eclipsed ethanes versus K is given. This energy 
difference has an experimentally determined value of 2.875 + 0.025 kcal/mole, the 
staggered form being more stable. To give reasonable agreement with this experimental 
value, K should be about 1.75. Thus, the value K = 1.75 is selected because the MO 
energy order and charge distribution are not sensitive to K in this region and because 
this value of K gives the correct total energy change for a known physical process. We 
have also seen earlier that this same value of K leads to a reasonable prediction for the 
equilibrium bond length in CH4. Notice that the evaluation of K comes by matching the 
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Figure 10-6 » Mulliken gross population on H [.N (H)] and Mulliken overlap populations in C-C and 
C-H bonds [n(C-C) and n(C-H)] of ethane as calculated by the EH method with various values of K. 
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Figure 10-7 » Extended Hiickel energy difference between staggered and eclipsed ethanes as a 
function of K. 
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EH total energy change to the total (nuclear repulsion plus electronic) observed energy 
change for internal rotation in ethane. This is consistent with our earlier interpretation 
of EH total energy. 


[J 10-2 Mulliken Populations 


We found that the electron densities and bond orders calculated in the simple Hiickel 
method were extremely useful for relating theory to observable molecular properties 
such as electron spin-resonance splittings or bond lengths. Hence, it is desirable that 
we find analogous quantities to describe the distribution of electrons in an all-valence- 
electron method like the EH method. A number of suggestions have been made. The 
one we use is due to Mulliken [8]. It is the most widely used, and, as we shall see, it 
has an especially direct and useful connection with the EH method. 

Consider a real, normalized MO, ¢;, made up from two normalized AOs, x ; and xx: 


$i —Cji Xj + Cki Xk (10-10) 
We square this MO to obtain information about electronic distribution: 
$2 = X? + ci xg t 2c ici xj Xk (10-11) 


If we integrate Eq. (10-11) over the electronic coordinates, we obtain (since $;, Xj, and 
xi are normalized) 


1— c5 + cj; + 2CjiCki Sj (10-12) 
where Sj, is the overlap integral between x; and xy. Mulliken suggested that one 


electron in $; should be considered to contribute e to the electron net AO population 


of Xj, er to the population of xg, and 2c jiCkiSjk to the overlap population between 
Xj and yx. If there are two electrons in ¢;, then these populations should be doubled. 
Let qj symbolize the net AO population of x; due to one electron in MOg;, and 


p, , symbolize the overlap population between x ; and x; due to this same electron. The 
above example leads to the following general definitions: 


qi = (10-13) 
Dig = 2C jiCki Sjk (10-14) 


We can now sum the contributions due to all the electrons present in the model system, 
obtaining a Mulliken net AO population qj for each AO x;, and a Mulliken overlap 
population p jx, for each distinct AO pair x ; and xx: 


MOs MOs 
qj- So nici = >> nig) (10-15) 
i i 
MOs MOs 
Pig = 2 nice Sj m > ni Pig (10-16) 


i i 


Notice that the sum of all the net AO and overlap populations must be equal to the total 
number of electrons in the model system. (In the EH method, this is the total number 
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of valence electrons.) This contrasts with the situation in the simple Hückel method 
where the sum of electron densities alone, exclusive of bond orders, equals the total 
number of x electrons. 

The Mulliken populations are useful indices ofthe location of electronic charge in the 
molecule and its bonding or antibonding nature. The contributions to such populations 
from one electron in MO ¢/ of methane are given below (data taken from Tables 10-3 
and 10-7): 


Gp, = (0.5313)? — 0.2823 (10-17) 
qi, = qf = (0.4529)? — 0.2051 (10-18) 
Pop, is. Pp,- 1s = 2(0.5313)(0.4529)(0.3964) 20.1908 — (10-19) 
Pi, 1s; = 2(0.4529) (—0.4529) (0.1805) = —0.0740 (10-20) 


All the other populations for $4 are zero. These numbers show that an electron in this 
MO contributes about 28%, of an electron to the carbon 2p, net AO population, 20-21% 
to each of two hydrogen AOs, 19% to each of two C-H bonds, and a negative 7% to 
the region between He and Hy. The last corresponds to an antibonding interaction, as 
is obvious from an examination of the sketch of $4 in Fig. 10-3. (A negative overlap 
population is interpreted to mean that the amount of charge in the overlap region is less 
than what would exist if one squared the two AOs and then combined them.) 

By summing over the contributions due to all eight valence electrons, we obtain the 
Mulliken populations shown in Table 10-8. These data are part of the normal output of 
an EH computer program. The matrix is symmetric, and only the unique elements are 
tabulated. Such a matrix is named a Mulliken overlap population matrix. 

The data in Table 10-8 indicate that the hydrogens have positive overlap populations 
with the 2s and 2p AOs of carbon but have small negative overlap populations with each 
other. This corresponds to saying that the hydrogens interact with the carbon AOs in a 
bonding way and in a weakly antibonding way with each other. The symmetry equiva- 
lence of the four hydrogen atoms results in their having identical net AO populations. 

Often we are interested in knowing how the hydrogen atoms interact with the carbon 
in toto, rather than with the 2s and 2p AOs separately. This can be obtained simply 


TABLE 10-8 » Mulliken Net AO and Overlap Populations for Methane as Computed by the 
Extended Hückel Method 


2s 2pz 2p. 2p, Isa lsp Ise lsg 
2s 0.6827 0.0 0.0 0.0 0.2229 0.2229 0.2229 0.2229 
2pz 0.5645 0.0 0.0 0.5723 0.0636 0.0636 0.0636 
2px 0.5645 0.0 0.0 0.5087 0.1272 0.1272 
2p, 0.5645 0.0 0.0 0.3815 0.3815 
Isa 0.6844 —0.0491 —0.0491 —0.0491 
lsp 0.6844 —0.0491 —0.0491 
Ise 0.6844 —0.0491 


lsa 0.6844 
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TABLE 10-9 » Reduced Net AO and Overlap Population Matrix for Methane 


C Ha Hp He Hg 
C 2.3762 0.7952 0.7952 0.7952 0.7952 
Hy 0.6844 —0.0491 —0.0491  —0.0491 
H; 0.6844  —0.0491  —0.0491 
He 0.6844  —0.0491 
Ha 0.6844 


by summing all the carbon atom AO contributions together to give a reduced overlap 
population matrix, shown in Table 10-9. The reduced population matrix makes evident 
the equivalence of the four C-H bonds, all of which have a total overlap population of 
0.7952 electrons. 

The Mulliken population scheme described above assigns some electronic charge to 
AOs, the rest to overlap regions. An alternative scheme, which assigns all the charge 
to AOs, was also proposed by Mulliken. One simply divides each overlap population 
in half, assigning half of the charge to each of the two participating AOs. When all 
the overlap populations have been reassigned in this way, the electronic charge is all 
in the AOs, and the sum of these AO charges still equals the total number of electrons. 
Mulliken called the AO populations resulting from this procedure gross AO populations. 
We will use the symbol N (.X) for the gross population in X, where X can be an AO or 
an atom (i.e., the sum of all gross AO populations on one atom). Clearly 


1 
NOD =q +52 Po (10-21) 
JAX 

The gross AO populations, gross atomic populations, and the resultant atomic charges 
(obtained by combining electronic and nuclear charges) for methane are listed in 
Table 10-10. These data suggest that the carbon has lost a very small amount of charge 
to hydrogen upon formation of the molecule. It would be very risky, however, to place 
much faith in such an interpretation. It turns out that populations are rather sensitive to 
choice of VSIEs. For example, Hoffmann’s original choice of VSIEs differs from that 


TABLE 10-10 » Gross AO Populations, Gross Atomic 
Populations, and Net Atomic Charges for Methane 


Gross AO Gross atom Net atomic 
population population charge 
1.12 
RE i 3.966 +0.0334 
Cop 0.946 
H^ 1.008 1.008 —0.0083 


“All 2p AOs and all H AOs have identical values because they are 
equivalent through symmetry. 
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used here, and he obtained gross populations for hydrogen atoms of around 0.9, giving 
net positive charges of 0.1. Thus, absolute values are not very useful, but changes 
in gross population as we go from one hydrogen to another in the same hydrocarbon 
molecule or in closely related molecules do appear to be rather insensitive to VSIE 
choice and are often in accord with results of more accurate calculations. 


E 10-3 Extended Hückel Energies and Mulliken 
Populations 


In Chapter 8 it was shown that a simple quantitative relation exists between the energy 
of a simple Hückel MO and the contributions of the MO to bond orders [Eq. (8-56)]. 
The more bonding such an MO is, the lower is its energy. A similar relationship will 
now be shown to hold for extended Hückel energies and Mulliken populations. 

The orbital energy for the real MO ¢; is (ignoring the spin variable) 


: Hg; d 
Ei = Lea (10-22) 
fo; dv 
AOs 
OS o acpi H ik 
— Lik cneu Hp (10-23) 


AOs 
3k CjiCkiSjk 


Assume that $; is normalized: The denominator is unity. The numerator of 
Eq. (10-23) contains diagonal and off-diagonal terms. If we separate these, and 
substitute the relation (10-4) for the off-diagonal terms, we obtain 


AOs AOs H;; + Hi 
2 JJ 
E; =) e Hj +2) eeu KS 7 — (10-24) 


j,k j<k 


Comparison with Eqs. (10-13) and (10-14) shows that the first sum contains contribu- 
tions to net AO populations and the second sum contains overlap population contribu- 
tions. That is, 


AOs AOs 
; ; Aig + Hkk 
E=) 4H t} PAK (10-25) 
j j<k 


This equation indicates that the energy of an extended Hückel MO is equal to its net con- 
tributions to AO populations times AO energy weighting factors plus its contributions 
to overlap populations times overlap energy weighting factors. 

By summing over all MOs times occupation numbers, we arrive at a relation between 
total EH energy and net AO and overlap populations: 


AOs AOs 
Hjj + Hkk 
E= MH RI 10-26 
2 qjHjj + 2 Pjk 2 ( ) 
J j«k 
Equations (10-25) and (10-26) are useful because they permit us to understand 
computed energies in terms of electron distributions. This is helpful when we seek to 
understand energy changes which occur as molecules are stretched, bent, or twisted. 
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There is an important difference between the extended Hückel formulas (10-25) and 
(10-26) and their simple Hückel counterparts. In a simple Hückel MO, the charge 
density contributions g^ always add up to unity, contributing œ to the MO energy 
(omitting heteroatom cases). The deviation of the MO energy from a is thus due only to 
the bond-order contributions p’. As a result, for hydrocarbons, the simple Hückel MO 
that is the more bonding of a pair is always lower in energy. This is not so, however, in 
extended Hückel MOs. Since the sum of AO and overlap populations must equal the 
number of electrons present, we can increase the total amount of overlap population 
only at the expense of net AO populations. Therefore, energy lowering due to the 
second sum of Eq. (10-25) is purchased at the expense of that due to the first sum. This 
complicates the situation and requires that we take a more detailed look before issuing 
any blanket statements. 

The following simple example provides a convenient reference point for discussion. 
Consider two identical 1s AOs, Xa and x», on identical nuclei separated by a distance R. 
These AOs combine to form MOs of og and oy symmetry. The og MO has an associated 
positive overlap population and equal positive net AO populations for x, and x». If R 
now is decreased slightly, the overlap population increases slightly (say by 285), so the 
net AO populations must each decrease by 5. The energy change for the MO, then, is 


H, H; 
A Eo, = —ô (Haa + Hop) +28 K = s E (10-27) 
Since the molecule is homonuclear, Haa = Hpp, and 
A Ec, = 26 Haa (K — 1) (10-28) 


IfkK>1, A Es, is negative (since Hyg is negative). This analysis shows that the choice 
of 1.75 as the value for K has the effect of making the increase in overlap population 
dominate the energy change. If K were less than unity, the net AO population changes 
would dominate. 

The above example suggests that the EHMO method lowers the energy by maxi- 
mizing weighted overlap populations at the expense of net AO populations. It also 
suggests that the EH energy should be lowered whenever a molecule is distorted in a 
way that enables overlap population to increase. These are useful rules of thumb, but 
some caution must be exercised since the existence of several different kinds of atom 
in a molecule leads to a more complicated relation than that in the above example. 

Our methane example illustrates the above ideas. We have already seen that the total 
EH energy for methane goes through a minimum around Rc_y = 1 A. Let us see how 
the individual MO energies change as a function of Rc_y and try to rationalize their 
behaviors in terms of the overlap population changes. A plotof the MO energies is given 
in Fig. 10-8. The lowest-energy MO is s type and C-H bonding. The overlap between H 
1s AOs and the C 2s STO increases as Rc_y decreases. As a result, the energy of this MO 
decreases as Rc—g decreases, favoring formation of the united atom. The second-lowest 
energy levelis triply degenerate and belongs to p-type C-H bonding MOs. A glance at 
Fig. 10-3 indicates that overlap between AOs will first increase in magnitude as Rcg 
decreases. But ultimately, at small Rc_y this behavior must be reversed because the 1s 
and 2p AOs are orthogonal when they are isocentric. Therefore, as Rc—y decreases, the 
overlap population first increases, then decreases toward zero. Consequently, the EH 
energy of this level first decreases, then increases. Since there are six electrons in this 
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Figure 10-8 » Extended Hückel MO energies for methane as a function of Rc. g. 


level and only two in the lowest-energy MO, this one dominates the total energy and is 
responsible for the eventual rise in total energy at small R that creates the minimum in 
Fig. 10-4. The two remaining (unoccupied) levels belong to antibonding MOs, and the 
overlap population becomes more negative as Rc-g decreases. The p-type level rises 
less rapidly and eventually passes through a maximum (not shown), again because the 
overlap population must ultimately approach zero as Rc_y approaches zero. 

Itis important to remember that all these remarks apply to the EH method only. The 
relationships between the EH method and other methods or with experimental energies 
is yet to be discussed. 


E 10-4 Extended Hückel Energies and Experimental 
Energies 


We have seen that a change in EH energy reflects certain changes in calculated Mulliken 
populations. We now consider the circumstances that should exist in order that such 
EH energy changes agree roughly with actual total energy changes for various systems. 
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In essence, there are two requirements. First, the population changes calculated 
by the EH method ought to be in qualitative agreement with the charge shifts that 
actually occur in the real system. This condition is not always met. Some systems, 
especially those having unpaired electrons, are not accurately representable by a single- 
configuration wavefunction (that is, by a single product or a single Slater determinant). 
An example of this was seen for some 1s2s states of helium (see Chapter 5). Since the 
EHMO method is based on a single configuration, it is much less reliable for treating 
such systems. Special methods exist for handling such systems, but they normally are 
not applied at the Hückel level of approximation. 

The second requirement is that, as the charge shifts in the real system, the total 
energy should change in the way postulated by the EH method. For instance, if the 
charge increases in bond regions, this should tend to lower the total energy. In actuality, 
this does not always happen. We can see this by returning to our example of two 
identical 1s AOs separated by R. The associated o; MO increases its overlap population 
monotonically as R decreases. Therefore, the EH postulate predicts that the total energy 
should decrease monotonically. It is clear, however, that this does not happen in any 
real system. If our system is H2, we know that the experimental energy decreases 
with decreasing R until R = Re, and increases thereafter (Fig. 7-18). If our system is 
Hes", the experimental total energy increases as R decreases, due to the dominance 
of internuclear repulsion.? Consideration of many examples such as these indicates 
that the postulate is usually qualitatively correct when we are dealing with interactions 
between neutral, fairly nonpolar systems that are separated by a distance greater than 
a bond length typical for the atoms involved. These, then, are circumstances under 
which the EHMO method might be expected to be qualitatively correct. 

In brief, we find two kinds of condition limiting the applicability of the EH method. 
The first is that the system must be reasonably well represented by a single configuration. 
Hence, closed-shell systems are safest. The second condition is that we apply the 
method to uncharged nonpolar systems where the nuclei that are undergoing relative 
motion are not too close to each other. 

These conditions are often satisfied by molecules undergoing internal rotation about 
single bonds. Thus, it is indeed appropriate that the internal rotation barrier in ethane 
was used to help calibrate the method. A comparison of other EH calculated barriers 
with experimental values (Table 10-11) gives us some idea of the capabilities and 
limitations of the method. For all the molecules listed, the most stable conformation 
predicted by the EH method agrees with experiment. This suggests that one can place 
a fair amount of reliance on the conformational predictions of the method, at least 
for threefold symmetric rotors. Also, certain gross quantitative trends, such as the 
significant reduction as we proceed down the series ethane, methylamine, methanol, are 
displayed in the EH results, but it is evident that the quantitative predictions of barrier 
height are not very good. This is not too surprising since the method was calibrated on 
hydrocarbons. Introduction of halogens, nitrogen, or oxygen produces greater polarity 
and might be expected to require a different parametrization. 

For reasons outlined earlier, we expect even less accuracy in EH calculations of 
molecular deformations involving bond-angle or bond-length changes. This expecta- 
tion is generally reinforced by calculation. Molecular shape predictions become poorer 


8 Actually, He?* has a minimum at short R in its energy curve due to an avoidance of curve crossing (discussed 
in Chapter 14), but this minimum is unstable with respect to the separated ions. See Pauling [9]. 


342 Chapter 10 The Extended Hückel Method 


TABLE 10-11 » Energy Barriers for Internal Rotation about Single Bonds^ 


Barrier (kcal/mole)? 


Molecule Calculated Experiment 
CH3-CH3 3.04 2.88 
CH3-NH»o 1.66 1.98 
CH3-OH 0.45 1.07 
CH3-CH?F 2.76 3.33 
CH3-CHF5 2.39 3.18 
CH3-CF3 2.17 3.25 
CH3-CH3CI 4.58 3.68 
CH3-CHCH5 1.20 1.99 
cis-CH3-CHCHCI 0.11 0.62 
CH3-CHO 0.32 1.16 
CH3-NCH5 0.44 1.97 


“Calculated barriers are for rigid rotation, where no bond length or angle changes occur 
except for the torsional angle change about the internal axis. 

The stable form for the first seven molecules has the methyl C-H bonds staggered with 
respect to bonds across the rotor axis. For the last four molecules, the stable form has 
a C-H methyl bond eclipsing the double bond. 


for more polar molecules (water is calculated to be most stable when it is linear), and 
bond-length predictions are quite poor too. 

Results such as these have tended to restrict use of the EH method to qualitative 
predictions of conformation in molecules too large to be conveniently treated by more 
accurate methods. However, just as the simple Hückel method underwent various 
refinements (such as the w technique) to patch up certain inadequacies, so has the EH 
method been refined. Such refinements” have been shown to give marked improvement 
in numerical predictions of various properties. The EH method has been overtaken in 
popularity by a host of more sophisticated computational methods. (See Chapter 11.) 
However, it is still sometimes used as a first step in such methods as a way to produce a 
starting set of approximate MOs. The EHMO method also continues to be important as 
the computational equivalent of qualitative MO theory (Chapter 14), which continues 
to play an important role in theoretical treatments of inorganic and organic chemistries 
(as, for example, in Walsh’s Rules and in Woodward-Hoffmann Rules). 


10-4.A Problems 


The following output is produced by an EH calculation on the formaldehyde molecule, 
and is referred to in Problems 10-1 to 10-10. 


9See Kalman [10], Boyd [11], and the references cited in these papers. Also, see Anderson and Hoffmann [12] 
and Anderson [13]. 
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Formaldehyde (Ground State) Orbital Numbering 


Orbital Atom n l m x y Z exp Hii 
1 H-1 1 0 0 —0.550000 0.952600 0.0 1.200-13.60 
2 H-2 1 0 0 -—0.550000 —0.952600 0.0  1.200-13.60 
3 C-3 2 0 0 0.0 0.0 0.0  1.625-19.44 
4 C-3 2 1 0 0.0 0.0 0.0 . 1.625-10.67 
5 C-3 2 1 1 0.0 0.0 0.0 . 1.625-10.67 
6 C-3 2 1 1 0.0 0.0 0.0 . 1.625-10.67 
7 O-4 2 0 0 1.220000 0.0 0.0 | 2.275-32.38 
8 O-4 2 1 0 1.220000 0.0 0.0  2.275-15.85 
9 O-4 2 1 1 1.220000 0.0 0.0  2.275-15.85 

10 O-4 2 1 1 1.220000 0.0 0.0 | 2.275-15.85 


Hij = K Sij (Hii + Hjj)/2. with K = 1.75. 


Distance Matrix (a.u.) 


1 2 3 4 


0.0 3.6004 2.0787 3.7985 
3.6004 0.0 2.0787 3.7985 
2.0787 0.0 2.3055 
3.7985 3.7985 2.3055 0.0 


WN 
n 
Q 
~ 
oo 
~ 


Total effective nuclear repulsion = 17.69537317 a.u. 


Formaldehyde Eigenvalues 


Eigenvalues (a.u.) Occ. no. Eigenvalues (a.u.) ^ Occ. no. 


E(1)— 1.039011 
E(2)= 0.472053 
E(3)= 0.314551 
E(4) = —0.342162 
E(5) = —0.517925 


E(6) = —0.587488 
E(7) =—0.597185 
E(8) — —0.611577 
E(9) = —0.755816 
E(10) = —1.242836 


l2000075 
t2 [R2 NYY WN 


Sum — —8.625654 a.u. 
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Reduced Overlap Population Matrix Atom by Atom 


1 2 3 4 

1 | 0.6876 —0.0702 0.7945 —0.0615 

2 |—0.0702 0.6876 0.7945 —0.0615 

3 | 0.7945 0.7945 1.4866 0.8148 

4 |—0.0615 —0.0615 0.8148 6.9277 

Orbital Charges Net Charges 

1 1.018973 6 0.879574 1 —0.018973 3 1.311555 

2 1.018973 7 1.767672 2 —0.018973 4 —1.273609 

3 1.026764 8 1.782529 

Total charge = 0.000000. 

4 0.17471 9 1.749778 

5 0.564637 10 1.973630 

10-1. Use the output to determine the orientation of the molecule with respect to Carte- 
sian coordinates. Sketch the molecule in relation to these axes and number the 
atoms in accord with their numbering in the output. 

10-2. Use the orbital numbering data together with the overlap matrix to figure out 
which of the labels 1s, 2s, 2px, 2py, 2pz goes with each of the ten AOs (i.e., it is 
obvious that AO 1 is a 1s AO, but it is not so obvious what AO 5 is). 

10-3. Use your conclusions from above, together with coefficients in the eigenvector 
matrix, to sketch the MOs having energies of —0.756, —0.611, and —0.597 a.u. 
Which of these are z MOs? Which are c MOs? 

10-4. Label each of the ten MOs “zr” or “o” by inspecting the coefficient matrix. 

10-5. What is the Mulliken overlap population between C and O 2p, AOs in this 
molecule? Should removal of an electron from MO 7 cause the C—O bond to 
shorten or to lengthen? 

10-6. Demonstrate that MO 7 satisfies Eq. (10-24). (Note that Hj; values in the first 
table are in units of electron volts, while orbital energies are in atomic units.) 

10-7. Use the reduced overlap population matrix to verify that the sum of AO and 
overlap populations is equal to the number of valence electrons. 

10-8. Using MO 7 as your example, verify that the charge matrix table is a tabulation 
of the contributions of each MO to gross atom populations. 

10-9. In the list of “orbital charges,” are the “orbitals” MOs or AOs? Demonstrate how 


these numbers are derived from those in the charge matrix. 
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10-10. What is the physical meaning of the “net charges" in the data? Would you char- 
acterize these results as indicative of low polarity? Which end of the molecule 
should correspond to the negative end of the dipole moment? 


10-11. How many MOs will be produced by an EHMO calculation on butadiene? 
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Chapter 11 


The SCF-LCAO-MO Method 
and Extensions 


[j| 11-1 Ab Initio Calculations 


A rigorous variational calculation on a system involves the following steps: 
1. Write down the hamiltonian operator H for the system. 


2. Select some mathematical functional form w as the trial wavefunction. This form 
should have variable parameters. 


3. Minimize 
f it Hyde 
f V*vdc 


with respect to variations in the parameters. 


E= (11-1) 


The simple and extended Hückel methods are not rigorous variational calculations. 
Although they both make use of the secular determinant technique from linear variation 
theory, no hamiltonian operators are ever written out explicitly and the integrations in 
Hj; are not performed. These are semiempirical methods because they combine the 
theoretical form with parameters fitted from experimental data. 

The term ab initio (“from the beginning") is used to describe calculations in which 
no use is made of experimental data. In an ab initio variational method, all three steps 
listed above are explicitly performed. In this chapter we describe a certain kind of 
ab initio calculation called the self-consistent field (SCF) method. This is one of the most 
commonly encountered types of ab initio calculation for atoms or molecules. We also 
describe a few popular methods for proceeding beyond the SCF level of approximation. 

The SCF method and extensions to it are mathematically and physically consider- 
ably more complicated than the one-electron methods already discussed. Thus, one 
normally does not perform such calculations with pencil and paper, but rather with 
complicated computer programs. Therefore, in this chapter we are not concerned with 
how one does such calculations because, in most cases, they are done by acquiring a 
program written by a group of specialists. Rather we are concerned with a description 
of the mathematical and physical underpinnings of the method. Because the method 
is simultaneously complicated and rigorously defined, a special jargon has developed. 
Terms like “Hartree-Fock,” or “correlation energy" have specific meanings and are 
pervasive in the literature. Hence, a good deal of emphasis in this chapter is put on 
defining some of these important terms. 
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N 11-2 The Molecular Hamiltonian 


In practice, one usually does not use the complete hamiltonian for an isolated molecular 
system. The complete hamiltonian includes nuclear and electronic kinetic energy oper- 
ators, electrostatic interactions between all charged particles, and interactions between 
all magnetic moments due to spin and orbital motions of nuclei and electrons. Also 
an accounting for the fact that a moving particle experiences a change in mass due to 
relativistic effects is included in the complete hamiltonian. The resulting hamiltonian 
is much too complicated to work with. Usually, relativistic mass effects are ignored, 
the Born-Oppenheimer approximation is made (to remove nuclear kinetic energy oper- 
ators), and all magnetic interactions are ignored (except in special cases where we are 
interested in spin coupling). The resulting hamiltonian for the electronic energy is, in 
atomic units, 


p 1 n N n n-l n 
ü--Yysi-6 gua el Yam) a 
i=l 


u=1 i=1 i=1 j=i+1 


where i and j are indices for the n electrons and u is an index for the N nuclei. The 
nuclear repulsion energy Vpn is 


N-1 N 
eo QU ZI (11-3) 
u=l v=u+1 


In choosing this hamiltonian, we are in effect electing to seek an energy of an 
idealized nonexistent system—a nonrelativistic system with clamped nuclei and no 
magnetic moments. If we wish to make a very accurate comparison of our computed 
results with experimentally measured energies, it is necessary to modify either the 
experimental or the theoretical numbers to compensate for the omissions in A. 


J 11-3 The Form of the Wavefunction 


The wavefunction for an SCF calculation is one or more antisymmetrized products of 
one-electron spin-orbitals. We have already seen (Chapter 5) that a convenient way to 
produce an antisymmetrized product is to use a Slater determinant. Therefore, we take 
the trial function y; to be made up of Slater determinants containing spin-orbitals $. 
If we are dealing with an atom, then the $'s are atomic spin-orbitals. For a molecule, 
they are molecular spin-orbitals. 

In our discussion of many-electron atoms (Chapter 5), we noted that certain atoms 
in their ground states are fairly well described by assigning two electrons, one of each 
spin, to each AO, starting with the lowest-energy AO and working up until all the 
electrons are assigned. If the last electron completes the filling of all the AOs having a 
given principal quantum number, n, we have a closed shell atomic system. Examples 
are He(1s?) and Ne(1s?2822p6). Atoms wherein the last electron completes the filling 
of all AOs having a given / quantum number are said to have a closed subshell. An 
example is Be(1s?2s?). Both types of system tend to be well approximated by a single 
determinantal wavefunction if the highest filled level is not too close in energy to the 
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lowest empty level. (Beryllium is the least successfully treated of these three at this level 
of approximation because the 2s level is fairly close in energy to the 2p level.) A similar 
situation holds for molecules; that is, the wavefunctions of many molecules in their 
ground states are well represented by single determinantal wavefunctions with electrons 
of paired spins occupying identical MOs. Such molecules are said to be closed-shell 
systems. We can represent a trial wavefunction for a 2n-electron closed-shell system as 


Vclosed shell = |ó1 (1) $1 (2) do (3) d2 (4) -+ bn Qn — 1) bn (2n)| (11-4) 


where we have used the shorthand form for a Slater determinant described in Chapter 5. 
For the present, we restrict our discussion to closed-shell single-determinantal wave- 
functions. 


[J 11-4 The Nature of the Basis Set 


Some functional form must be chosen for the MOs @. The usual choice is to approximate 
$ as a linear combination of “atomic orbitals” (LCAO), these AOs being located on the 
nuclei. The detailed nature of these AOs, as well as the number to be placed on each 
nucleus, is still open to choice. We consider these choices later. For now, we simply 
recognize that we are working within the familiar LCAO-MO level of approximation. 
If we represent the basis AOs by x, we have, for the ith MO, 


$i — M cux; (11-5) 
j 


where the constants c j; are as yet undetermined. 


[J 11-5 The LCAO-MO-SCF Equation 


Having a hamiltonian and a trial wavefunction, we are now in a position to use the linear 
variation method. The detailed derivation of the resulting equations is complicated 
and notationally clumsy, and it has been relegated to Appendix 7. Here we discuss the 
results of the derivation. 

For our restricted case of a closed-shell single-determinantal wavefunction, the vari- 
ation method leads to 


Foi — eii (11-6) 


These equations are sometimes called the Hartree-Fock equations, and F is often 
called the Fock operator. The detailed formula for F is (from Appendix 7) 


x E-z TE 
R= N= 2 Zulia "EUN - Kp (11-7) 
J= 


The symbols J '; and K j Stand for operators related to the 1/7;; operators in A. J is 
called a coulomb operator because it leads to energy terms corresponding to charge 
cloud repulsions. It is possible to write J; explicitly: 


Jj= J $; (2) (1/ri2)j(2)dt (2) (11-8) 
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K j leads ultimately to the production of exchange integrals, and so it is called an 
exchange operator. It is written explicitly in conjunction with a function on which it 
is operating, viz. 


Kos fo; (2) (1/ri2) bi (2)dt Q) 6; (1) (11-9) 


Notice that an index exchange has been performed. It is not difficult to see that the 
expression (see Appendix 9 for bra-ket notation) 


(il Flo: — ei (11-10) 

will lead to integrals such as 
(dilit) = (6i (D 6; DIr: 006; Q) = Jj (11-11) 
(bil K jlbi) = (6i (06; Q) | /rizlói (2) 6; () = Kij (11-12) 


which are formally the same as the coulomb and exchange terms encountered in 
Chapter 5 in connection with the helium atom. Notice that, if the spins associated with 
spin-orbitals ¢; and $; differ, K;; must vanish. This arises because integrations over 
space and spin coordinates of electron 1 (or 2) in Eq. (11-12) lead to integration over 
two different (and orthogonal) spin functions. On the other hand, J;; is not affected by 
such spin agreement or disagreement. 

It would appear from Eq. (11-6) that the MOs ¢ are eigenfunctions of the Fock 
operator and that the Fock operator is, in effect, the hamiltonian operator. There is an 
important qualitative difference between F and H, however. The Fock operator is itself 
a function of the MOs $. Since the summation index j in Eq. (11-7) includes i, the 
operators a and Ke must be known i in order to write down F, but J and É; involve jj, 
and ġ; is an eigenfunction of Ê. Hence, we need Ê to find $;, and we need @¢; to know 
F. To circumvent this problem, an iterative approach is used. One makes an initial 
guess at the MOs $. (One could use a semiempirical method to produce this starting 
set.) Then these MOs are used to construct an operator F, which is used to solve for 
the new MOs ¢’. These are then used to construct a new Fock operator, which is in 
turn used to find new MOs, which are used for a new Ê , etc., until at last no significant 
change is detected in two successive steps of this procedure. At this point, the $'s 
produced by F are the same as the $'s that produce the coulomb-and-exchange fields 
in F. The solutions are said to be self-consistent, and the method is referred to as the 
self-consistent-field (SCF) method. 


[J 11-6 Interpretation of the LCAO-MO-SCF Eigenvalues 
The physical meaning of an eigenvalue e; is best understood by expanding the integral 
ei = (bil Plon) (11-13) 


with £ given by Eq. (11-7). We obtain 
n 
pi) — Y i |Zu/rut| bi) + » QJ; — Kis) (11-14) 
H 


j=l 


l2 
€i = ($i "5i 
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It is common practice to combine the first two terms of Eq. (11-14), which depend 
only on the nature of ¢;, into a single expectation value of the one-electron part of the 
hamiltonian, symbolized H;;. Thus, 


n 
«= Hu +) (QA; - Kip (11-15) 
j=l 


The quantity Hj; is the average kinetic plus nuclear-electronic attraction energy for the 
electron in dj. 

The sum of coulomb and exchange integrals in Eq. (11-15) contains all the electronic 
interaction energy. Observe that the index j runs over all the occupied MOs. For a 
particular value of j, say j= k z i, this gives 2J;; — Kij as an interaction energy. This 
means that an electron in ¢;, experiences an interaction energy with the two electrons 
in $y of 


2 (6; A) be Q)]1/rioló; A) be (2)) — (6:009 2) riz] be 1) Gi 2) (11-16) 


The first part is the classical repulsion between the electron having an orbital charge 
cloud given by |¢;|* and the two electrons having charge cloud |ġg|?. The second part is 
the exchange term which, as we saw in Chapter 5, arises from the antisymmetric nature 
of the wavefunction. It enters (11-16) only once because the electron in ¢;, agrees in 
spin with only one of the two electrons in $;, [Equation (11-15) applies because we 
have restricted our discussion to closed-shell systems. ] 

The summation over 2J;; — K;; includes the case j =i. Here we get 2J;; — Kii. 
However, examination of Eqs. (11-11) and (11-12) shows that J;; = Kii, and so we are 
left with J;;. This corresponds to the repulsion between the electron in $; (the energy 
of which we are calculating) and the other electron in ¢;. Because these electrons must 
occur with opposite spin, there is no exchange energy for this interaction. 

In brief, then, the quantity €;, often referred to as an orbital energy or a one-electron 
energy, is to be interpreted as the energy of an electron in ¢;, resulting from its kinetic 
energy, its energy of attraction for the nuclei, and its repulsion and exchange energies 
due to all the other electrons in their charge clouds |@ Jg 


N 11-7 The SCF Total Electronic Energy 


It is natural to suppose that the total electronic energy is merely the sum of the one- 
electron energies, but this is notthe case in SCF theory. Consider atwo-electron system. 
The energy of electron 1 includes its kinetic and nuclear attraction energies and its 
repulsion and exchange energies for electron 2. The energy of electron 2 includes its 
kinetic and nuclear attraction energies and its repulsion and exchange energies for elec- 
tron 1. If we sum these, we have accounted properly for kinetic and nuclear attraction 
energies, but we have included the interelectronic interactions twice as much as they 
actually occur. (The energy of repulsion, say, between two charged particles, 1 and 2, is 
given by the repulsion of 1 for 2 or of 2 for 1, but not by the sum of these.) Therefore, if 
we sum one-electron energies, we get the total electronic energy plus an extra measure 
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of electron repulsion and exchange energy. We can correct this by subtracting this extra 
measure away. Thus, for our closed-shell system 


n 


n 
Eac =) |26; — 9 (2Jij - Kip (11-17) 
j=l 


i=l 
where the summation is over the occupied orbitals. Comparing Eq. (11-17) with (11-18) 


makes it evident that we can also write 


n 


n 
Eetec = >) | 2Ha +9 QJ — Kij) (11-18) 
i=l J=l 


or 
n 
Eec = X (ei + Hii) (11-19) 
i=l 


To obtain the total (electronic plus nuclear) energy, we add the internuclear repulsion 
energy for the N nuclei: 


Etot = Eelec + Von (1 1-20) 
N-1 N 
ZuZ 
m= > -—— (11-21) 
u=l v=u+!1 Tuv 


[J 11-8 Basis Sets 


A great deal of research effort has gone into devising and comparing basis sets for 
ab initio calculations. There are essentially two important criteria: 


1. We want a basis set that is capable of describing the actual wavefunction well enough 
to give chemically useful results. 


2. We want a basis set that leads to integrals F;; and S;; that we can evaluate reasonably 
accurately and quickly on a computer. 


Many types of basis set have been examined and two of these have come to dominate 
the area of ab initio molecular calculations. These two, which we referto as the gaussian 
and the Slater-type-orbital (STO) basis sets, are actually very similar in many important 
respects. 

Let us consider the STO basis set first. The essence of this basis choice is to place 
on each nucleus one or more STOs. The number of STOs on a nucleus and the orbital 
exponent of each STO remain to be chosen. Generally, the larger the number of STOs 
and/or the greater the care taken in selecting orbital exponents, the more accurate the 
final wavefunction and energy will be. 

At the least sophisticated end of the spectrum of choices is the minimal basis set of 
STOs, which we encountered in Chapter 7. This includes only those STOs that corre- 
spond to occupied AOs in the separated atom limit. If we choose a minimal basis set, 
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then we must still decide how to evaluate the orbital exponents in the STOs. One way 
is to use Slater's rules, which are actually most appropriate for isolated atoms. Another 
way is to vary the orbital exponents until the energy of the molecular system is mini- 
mized. This amounts to performing a nonlinear variational calculation along with the 
linear variational calculation. For molecules of more than a few atoms, this procedure 
consumes much computer time, for reasons we will describe shortly, but for small 
molecules (two or three first-row atoms plus a few hydrogens) it is possible to accom- 
plish this task. From this, one discovers what orbital exponent best suits an STO in a 
molecular environment. This leads us to the third way of choosing orbital exponents— 
choose the values that were found best for each type of atom in nonlinear variational 
calculations in smaller molecules. 

One may improve the basis by adding additional STOs to various nuclei. Suppose, 
for example, each carbon 2p AO were represented as a linear combination of two p-type 
STOs, each having a different orbital exponent. An example of the basic principle 
involved is indicated in Fig. 11-1. If we treat these functions independently and do a 
linear variational calculation, they will both be mixed into the final wavefunction to 
some degree. If the linear coefficient for the “inner” STO is much larger, it means that 
the p-type charge cloud around this atom in the molecule is calculated to be fairly con- 
tracted around the nucleus. To describe a more diffuse charge cloud, the wavefunction 
would contain quite a lot of the “outer” STO, and not so much of the “inner” STO. 

Thus, we have a linear variation procedure that, in effect, allows for AO expansion 
and contraction. It is akin to optimizing an orbital exponent, but it does not require 
nonlinear variation. Of course, one still has to choose the values of the "larger ¢” and 
"smaller ¢” of Fig. 11-1. This is normally done by optimizing the fit to very accurate 
atomic wavefunctions or by a nonlinear variation on atoms. A basis set in which every 
minimal basis AO is represented by an "inner-outer" pair of STOs is often referred to 
as a “double-zeta” basis set. 

A further kind of extension is frequently made. In addition to the above types of STO, 
one includes STOs with symmetries different from those present in the minimal basis. 
This has the effect of allowing charge to be shifted in or out of bond regions in new 
ways. For example, one could add p-type STOs on hydrogen nuclei. By mixing this 
with the s-type STOs there, one can describe a skewed charge distribution in the regions 
of the protons. We have already seen (Chapter 7) that a hydrogen atom in a uniform 


Larger ¢ "inner" STO 


Smaller $ "outer" STO 


Rir) 


0 r 


Figure 11-1 » Radial functions R(r) 2 r exp(—¢r) for 2p-type STOs. The larger ¢ value gives an 
STO more contracted around the nucleus. Hence, it is sometimes called the “inner” STO. 
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electric field is polarized in a way that is reasonably well described by an s-p linear 
combination. Since the hydrogen atom in a molecule experiences an electric field due 
to the remainder of the molecule, it is not surprising that such p functions are indeed 
mixed into the wavefunction by the variational procedure if we provide them in the 
basis set. Similarly, d-type STOs may be added to atoms that, in the minimal basis set, 
carried only s- and p-type STOs. Functions of this nature are often called polarization 
functions because they allow charge polarization to occur within the molecule as a 
result of the internally generated electric field. 

It should be evident that one could go on indefinitely, adding more and more STOs to 
the basis, even placing some of them in bonds, rather than on nuclei. This is not normally 
done because the computing task goes up enormously as we add more basis functions. 
In fact, the number of integrals to be calculated eventually increases as N ^ where N is 
the number of basis functions. The evaluation of integrals can be a logistic bottleneck in 
ab initio calculations, and for this reason nonlinear variations (of orbital exponents) are 
impractical for any but smaller molecules. Each new orbital exponent value requires 
re-evaluation of all the integrals involving that orbital. In essence, a change of orbital 
exponent is a change of basis set. In linear variations, the basis functions are mixed 
together but they do not change. Once all the integrals between various basis functions 
have been evaluated, they are usable for the remainder of the calculation. 

The STO basis would probably be the standard choice if it were not for the fact 
that the many integrals encountered in calculating F;; elements are extremely time 
consuming to evaluate, even on a computer. This problem has led to the development of 
an alternative basis set class that is based on gaussian-type functions. Gaussian func- 
tions include an exponential term of the form exp(—ar*). The radial dependence of 
such a function is compared to that for a hydrogenlike 1s function (which is identical to 
a Is STO) in Fig. 11-2. There are two obvious problems connected with using gaussian 
functions as basis functions: 


1. They do not have cusps at r = 0 as s-type hydrogen-like AOs do. 
2. They decay faster at larger r than do hydrogen-like AOs. 


Both of these deficiencies are relevant in molecules because, at r = 0 (on a nucleus) 
and at r — oo, the molecular potential is like that in an atom, so similar cusp and asymp- 
totic behavior are expected for molecular and atomic wavefunctions. Balanced against 
these deficiencies is an advantage: gaussian functions have mathematical properties 
that make it extremely easy to compute the integrals they produce in F;;. This has led 
to a practice of replacing each STO in a basis set by a number of gaussian functions. 


Rir) 


0 1 r 


Figure 11-2 » Radial dependence of hydrogen-like and gaussian functions. 
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By choosing several values of œ in exp(—ar7), one can create a set of “primitive” 
gaussian functions ranging from very compact to very diffuse, and then take a linear 
combination of these to build up an approximation to the radial part of an STO function. 
Multiplication by the standard 0 and $ dependences (spherical harmonics) generates 
p. d, etc. functions. Once this approximation is optimized, the linear combination of 
gaussian functions is “frozen,” being treated thereafter as a single function insofar as 
the subsequent molecular variational calculation is concerned. This linear combination 
of primitive gaussian functions is called a contracted gaussian function. 

Once we have a contracted gaussian function corresponding to each STO, we can go 
through the same hierarchy of approximations as before—minimal basis set, double- 
¢ basis set, double-¢ plus polarization functions-only now using contracted gaussian 
functions in place of STOs. Typically, ab initio calculations on systems involving only 
light elements, e.g., H-Ne, involve anywhere from 1 to 15 primitive gaussian functions 
for each contracted gaussian function. Basis sets for heavier elements, however, can 
contain more than 30 primitive gaussians for each contracted gaussian. 

We have already described a certain amount of quantum-chemical jargon. Some of 
the basis set descriptions that one commonly encounters in the modern literature are as 
follows: 


e DZP [double-Z gaussian basis with polarization] 


e STO-3G [each STO approximated as a linear combination of three gaussian prim- 
itives] 


e 6-31G [eachinner shell STO represented by a sum of six gaussians and each valence 
shell STO split into inner and outer parts (i.e., double-Z) described by three and 
one gaussian primitives, respectively] 


e 6-31G* [the 6-31G basis set augmented with six d-type gaussian primitives on each 
heavy (Z > 2) atom, to permit polarization] 


e 6-31G** [same as 6-31G* but with a set of gaussian p-type functions on H and He 
atoms. Good for systems where hydrogen is a bridging atom, as in diborane or in 
hydrogen bonds] 


e 6-31+G* [the 6-31G* basis set augmented with a set of diffuse s- and p-type gaus- 
sian functions on each heavy atom, to permit representation of diffuse electronic 
distribution, as in anions] 


e cc-pVnZ,n =D, T, Q, 5 [correlation consistent polarized valence n-¢ gaussian basis 
sets. The inner shell STOs are described by single contracted gaussian functions 
while the valence STOs are described by n contracted gaussian functions, n = D 
for double-Z, n = T for triple-Z, etc. Both the number and angular momentum 
symmetry type of the polarization functions are increased with each successive 
correlation consistent basis set in a systematic manner. For example, the cc-pVDZ 
basis sethas a set of 5 d-type gaussian primitive functions on each heavy atom, while 
the cc-pV TZ basis set has 2 sets of d-type gaussian functions and one set of 7 f-type 
gaussian primitives. These families of basis sets are designed to converge the total 
energy to the complete basis set limit for the SCF method and its extensions.] 
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e aug-cc-pVnZ [cc-pVnZ basis sets augmented with one set of diffuse primitive 
gaussian functions for each angular momentum symmetry present in the cc-pVnZ 
basis set, to provide an accurate description of anions and weak interactions, e.g., 
van der Waals forces and hydrogen bonding.] 


[J 11-9 The Hartree-Fock Limit 


It should be apparent that different choices of basis set will produce different SCF 
wavefunctions and energies. Suppose that we do an SCF calculation on some molecule, 
using a minimal basis set and obtain a total electronic energy £j. If we now choose 
a double-¢ basis and do a new SCF calculation, we will obtain an energy £» that 
normally will be lower than £1. (If one happens to choose the first basis wisely and 
the second unwisely, it is possible to find E» higher than Æ. We assume here that 
each improvement to the basis extends the mathematical flexibility while including the 
capabilities of all preceding bases.) If we now add polarization functions and repeat 
the SCF procedure, we will find £3 to be lower than £2. We can continue in this way, 
adding new functions in bonds and elsewhere, always increasing the capabilities of our 
basis set, but always requiring that the basis describe MOs in a single determinantal 
wavefunction. The electronic energy will decrease with each basis set improvement, but 
eventually this decrease will become very slight for any improvement; that is, the energy 
will approach a limiting value as the basis set approaches mathematical completeness. 
This limiting energy value is the lowest that can be achieved for a single determinantal 
wavefunction. It is called the Hartree—Fock energy. The MOs that correspond to this 
limit are called Hartree-Fock orbitals (HF orbitals), and the determinant is called the 
HF wavefunction. 

Sometimes the term restricted Hartree-Fock (RHF) is used to emphasize that the 
wavefunction is restricted to be a single determinantal function for a configuration 
wherein electrons of o spin occupy the same space orbitals as do the electrons of 
p spin. When this restriction is relaxed, and different orbitals are allowed for electrons 
with different spins, we have an unrestricted Hartree—Fock (UHF) calculation. This 
refinement is most likely to be important when the numbers of œ- and 6-spin electrons 
differ. We encountered this concept in Section 8-13, where we noted that the unpaired 
electron in a radical causes spin polarization of other electrons, possibly leading to 
negative spin density. 


N 11-10 Correlation Energy 


The Hartree—Fock energy is not as low as the true energy of the system. The mathemat- 
ical reason for this is that our requirement that y, be a single determinant is restrictive 
and we can introduce additional mathematical flexibility by allowing w to contain many 
determinants. Such additional flexibility leads to further energy lowering. 

There is a corresponding physical reason for the HF energy being too high. It is con- 
nected with the independence of the electrons in a single determinantal wavefunction. 
To understand this, consider the four-electron wavefunction 


V = |h Dp DABA) (11-22) 
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Recall from Chapter 5 that the numbers in parentheses stand for the spatial coordinates 
of an electron; that is, $1 (1) really means $1 (x1, y1, zi)o (1) or $1(71, 01, $1)o(1).! In 
other words, if we pick values of r, 0, and $ for each of the four electrons and insert 
them into Eq. (11-22) we will be able to evaluate each function and we will obtain 
a determinant of numbers which can be evaluated to give a numerical value for y 
and Y°. The latter number (times dv) can be taken as the probability for finding one 
electron in the volume element around r1, 01, and $,, another electron simultaneously 
in dv» at r2, 0», and 2, etc. The important point to notice is that the effect on y? of 
a particular choice of r1, 01, and $4, is not dependent on choices of r, 0, $ for other 
electrons because the form of the wavefunction is products of functions of independent 
coordinates. Physically, this corresponds to saying that the probability for finding an 
electron in dvi, at some instant is not influenced by the presence or absence of another 
electron in some nearby element dv», at the same instant. This is consistent with the 
fact that the Fock operator Ê [Eq. (11-7)] treats each electron as though it were moving 
in the time-averaged potential field due to the other electrons. 

Because electrons repel each other, there is a tendency for them to keep out of 
each other's way. That is, in reality, their motions are correlated. The HF energy 
is higher than the true energy because the HF wavefunction is formally incapable of 
describing correlated motion. The energy difference between the HF and the "exact" 
(for a simplified nonrelativistic hamiltonian) energy for a system is referred to as the 
correlation energy. 


11-11 Koopmans' Theorem 


Despite the fact that the total electronic energy is not given by the sum of SCF one- 
electron energies, it is still possible to relate the e;'s to physical measurements. If 
certain assumptions are made, it is possible to equate orbital energies with molecular 
ionization energies or electron affinities. This identification is related to a theorem due 
to Koopmans. 

Koopmans [1] proved? that the wavefunction obtained by removing one electron 
from $x, or adding one electron to the virtual (i.e., unoccupied) MO $; in a Hartree— 
Fock wavefunction is stable with respect to any subsequent variation in k, or $j. 
Notice that this ignores the question of subsequent variation of all of the MOs $ with 
unchanged occupations. It is not necessarily true that they remain optimized, since 
the potential they experience is changed by addition or removal of an electron. Nev- 
ertheless, Koopmans' theorem suggests a model. It suggests that we approximate the 
wavefunction for a positive ion by removing an electron from one of the occupied HF 
MOS for a neutral molecule without reoptimizing any of the MOs. Let us do this and 
compare the electronic energies for the two wavefunctions. 

For the neutral molecule, which we assume is a closed-shell system, 


E=) >| 2Hi+) O= Kip (11-23) 
i J 


1 Note that $ 1 in parentheses represents a coordinate of electron 1, whereas $ outside the parentheses represents 
an MO. 
2See also Smith and Day [2]. 
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For the cation, produced by removing an electron from d$, 


Ej =} | 2Ha +9 (QJ; - Kip | + Hea + ? QJ — Ki) (11-24) 
izk jzk izk 


The first sum in Eq. (11-24) gives the total electronic energy due to all but the unpaired 
electron in $y. Hpk gives the kinetic and nuclear attraction energies for the unpaired 
electron and the final sum gives the repulsion and exchange energy between this electron 
and all the others. Now we note that the last sum is exactly equal to the void produced in 
the first sum due to the restriction j Æ k. Therefore, we can combine these by removing 
the index restriction and deleting the last sum. This gives 


Ef =>) 2Huüc Qy- Kis) | + Hu (11-25) 
izk j 
To compare this with E of (11-23) we should remove the remaining index restriction. 


We do this by allowing i to equal & in the sum and simultaneously subtracting the new 
terms thus produced: 


Eg =~ | 2+ Y OJy- Kis) | - Ha — 9 QJ Ku) — 0126) 
i J J 


But, by virtue of Eqs. (11-15) and (11-23), this is 
Ei =E-e (11-27) 
Hence, the ionization energy J, 9 for ionization from the oy is 
I = Et — E = —er (11-28) 


This illustrates that, within the context of this simplified model, the negative of the 
orbital energies for occupied HF MOs are to be interpreted as ionization energies. 

Another way to see the relation between 7, ? and —e;, is to recognize that the physical 
interactions lost upon removal of an electron from gx, are precisely those that constitute 
€k, [See Eq. (11-15).] 

A similar result holds for orbital energies of unoccupied HF MOs and electron 
affinities. (However, this is less successful in practice; see Problem 11-3.) 

In actuality, the relation (11-28) is only approximately obeyed. One reason for this 
has to do with our assumption that doubly occupied SCF MOs produced by a variational 
procedure on the neutral molecule will be suitable for the doubly occupied MOs of the 
cation as well. These MOs minimize the energy of the neutral molecule but give an 
energy for the cation that is higher than what would be produced by an independent vari- 
ational calculation. For this mathematical reason, we expect the Koopmans' theorem 
prediction for the ionization energy to be higher than the value predicted by taking the 
difference between separate SCF calculations on the molecule and cation (which we 
will symbolize ASCF). The corresponding physical argument is that use of Eq. (11-28) 
views ionization as removal of an electron without any reorganization of the remaining 
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TABLE 11-1 » Ionization Energies (in electron volts) of Water as 
Measured Experimentally and as Predicted from SCF Calculations 


SCF (near HF limit)? 
Cation state Observed^ Koopmans ASCF 
2B, 12.62 13.79 11.08 
223 14.74 15.86 13.34 
By 18.51 19.47 17.61 


“From Potts and Price [3]. 
PFrom Dunning et al. [4]. 


electronic charge. This neglects a process that stabilizes the cation and lowers the ion- 
ization energy. Whichever argument we choose, we have here a reason for expecting 
—€ to be an overestimate of the value obtained by independent calculations, ASCF. 

Another error results from the neglect of change in correlation energy. We have seen 
that the total SCF energy for the molecule is too high because the single determinantal 
form ofthe wavefunction cannot allow for correlated electronic motion. The SCF energy 
for the cation is too high for the same reason, but the error is different for the two 
cases because there are fewer electrons in the cation. We expect the neutral molecule 
to have the greater correlation energy (since it has more electrons)? so that proper 
inclusion of this feature would lower the energy of the neutral molecule more than the 
cation, making the true Z s larger than that obtained by neglect of correlation. Hence, 
this leads us expect ASCF to underestimate I» Since —e overestimates ASCF, and 
ASCF underestimates the ionization energy, we can expect some cancellation of errors 
in using Eq. (11-28). 

An illustration of these relations is provided in Table 11-1, where observed vertical 
ionization energies (i.e., no nuclear relaxation), the appropriate values of —e, and the 
values of ASCF are compared. 


E 11-12 Configuration Interaction 


There are several techniques for going beyond the SCF method and thereby including 
some effects of electron correlation. Some extremely accurate calculations on small 
atoms and molecules, making explicit use of interparticle coordinates, were described 
in Section 7-8. There is one general technique, however, that has traditionally been used 
for including effects of correlation in many-electron systems. This technique is called 
configuration interaction (CI). 

The mathematical idea of CI is quite obvious. Recall that we restricted our SCF 
wavefunction to be a single determinant for a closed-shell system. To go beyond the 
optimum (restricted Hartree—Fock) level, then, we allow the wavefunction to be a linear 


This reasoning is rather naive. Significant correlation energy contribution can result from a small energy-level 
separation between filled and empty MOs (rather than from merely the number of electrons), but production of a 
cation should normally increase this gap and lead to reduced correlation. 
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combination of determinants. Suppose we choose two determinants D; and D», each 
corresponding to a different orbital occupation scheme (i.e., different configurations). 
Then we can let 


y —c1 Di t co D» (11-29) 


and minimize Æ as a function of the linear mixing coefficients c, and c2. 

If we go through the mathematical formalism and express E as Qiu A Iv)/ Qv). 
expand this as integrals over Dı and D», and require 3E /ðci = 0, we obtain the same 
sort of 2 x 2 determinantal equation that we find when minimizing an MO energy as a 
function of mixing of two AOs. That is, we obtain 


Bus ue ee m 

where now 
Hy = (5 A Dj) (11-31) 
Si; = (Di|D;) (11-32) 


We see that, whereas before we might have had two AOs interacting to form two 
MOs, here we have two configurations (i.e., two determinantal functions) interacting to 
form two approximate wavefunctions. Our example involves only two configurations, 
but there is no limit to the number of configurations that can be mixed in this way. 

Since each configuration D contains products of MOs, each of which is typically a 
sum of AOs, the integrals H;; and 5j; can result in very large numbers of integrals over 
basis functions when they are expanded. This is the sort of situation where a computer 
is essential, and CI on atoms and molecules, while still expensive compared to SCF, 
have become routine on modern computers. 

Our purpose in this chapter is not to describe how to carry out a CI calculation, but 
rather to convey what a CI calculation is and what its predictive capabilities are. There- 
fore, we will not concern ourselves with the mathematical complexities of evaluating 
Hj; and S; TA But we will consider one practical aspect of CI calculations, namely, how 
one goes about choosing which configurations should be mixed together, and which 
ones may be safely ignored. 

We begin by considering the H2 molecule. The LCAO-MO-SCF method expresses 
the ground state wavefunction for Hz as 


floa) loge) 


= (11-33) 
log(D8(1)  logQ2)8Q) 


y, 2) 


that is, as the configuration log. The SCF procedure mixes the AO basis functions 
together in the optimum way to produce the 1o; MO. 

We have noted at several points in this book that, if one begins with a basis set of n 
linearly independent functions, one ultimately arrives at n independent MOs. Hence, 
the log MO of Eq. (11-33) is but one of several MOs produced by the SCF procedure. 


^In most actual calculations, the D's are orthonormal, and S; j= ôi j- 
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Itis called an occupied MO because it is occupied with electrons in this configuration. 
All the other MOs in this case are unoccupied or virtual MOs. The virtual MOs of 
Hz have symmetry properties related to the molecular hamiltonian, just as does the 
occupied MO. Thus, we can refer to loy, 20g, 20u, lu, 171g, etc., virtual MOs of H2. 
Which of these virtual MOs are produced by an SCF calculation depends on the number 
and nature of the AO basis set provided at the outset. If no x-type AOs are provided, 
no x-type MOs will be produced. If only a minimal basis (1s; and 1s;5) is provided, 
loy will be the only virtual MO produced. 

Itis important to distinguish between the physical content of occupied versus virtual 
SCF MOs. The SCF procedure finds the set of occupied MOs for a system leading to the 
lowest SCF electronic energy. The virtual orbitals are the residue of this process. The 
virtual MOs span that part of the basis set function space that the SCF procedure found 
least suitable for describing yy. The subspace is sometimes referred to as the orthogonal 
complement of the occupied orbital subspace. (Note that this situation differs from that 
pertaining to Hückel-type calculations, where MOs and energy levels are calculated 
without regard for electron occupancy. Only after the variational procedure are electrons 
added.) 

Our concern with virtual MOs is due to the fact that they provide a ready means for 
constructing new configurations to mix with our log configuration for H2. Thus, using 
some of the above-mentioned virtual MOs, we could write determinantal functions cor- 
responding to the excited configurations logloy, log2og, log2oy, logl7tu, etc.? These 
are commonly referred to as singly excited configurations because one electron has 
been promoted from a ground-state-occupied MO to a virtual MO. (This is not meant 
to imply that the orbital energy difference is equal to the expected spectroscopic energy 
of the transition.) It is also possible to construct doubly excited configurations, such as 
192. 1ou20g, 202, loy2oy, loy la, etc. For systems having more electrons, one can 
write determinants corresponding to triple, quadruple, etc., excitations. If one has a 
reasonably large number, say 50, of virtual orbitals and, say, 10 electrons to distribute 
among them, then there is an enormous number of possible configurations. A major 
step in doing a CI calculation is deciding which configurations might be important in 
affecting the results and ought therefore to be included. 

We can gain insight into this problem by considering our minimal basis set H2 
problem in more detail. We have 


log = Ns(1s4 + 15g) (11-34) 
loy = Nu(1s4 — lsg) (11-35) 


where N, and N, are normalization constants. The spatial part of the ground con- 
figuration is 


V'space = log(1)1og(2) (11-36) 


5As was shown in Chapter 5, the symmetry requirements of the wavefunction require that each of these open 
shell configurations be expressed as a linear combination of two 2 x 2 determinants; for example, log2oy stands 
for the combination 


(142) {|1og(1)26u(2)| € |16g (1)20u(2)]} 


Section 11-12 Configuration Interaction 363 


which expands to 
Vspace = Ng [154 (1) 154 (2) + 15g (1) 15g Q) + 154 (1) Isp (2) + 15g (1) 154 Q)] 


If both electrons are near nucleus A, the first term is quite large. This may be 
rephrased to say that /? gives a sizable probability for finding both electrons near 
nucleus A. The second term gives a similar likelihood for finding both electrons near B. 
These two terms are referred to as ionic terms because they become large whenever the 
instantaneous electronic dispositions correspond to HjiHj and HH}. respectively. 
The last two terms cause y/? to be sizable whenever an electron is near each nucleus. 
Hence, these are called covalent terms, and their presence means that y contains sig- 
nificant “covalent character." In fact, because all four terms have the same coefficient, 
the configuration 102 is said to have 50% covalent and 50% ionic character. 

Is this bad? It turns out to be no problem at all when the nuclei are close together. 
Indeed, in the united-atom (helium) limit, the ionic-covalent distinction vanishes. But 
at large internuclear separations it is very inaccurate to describe Hz as 50% ionic. In 
reality, Hz dissociates to two neutral ground state H atoms—that is, 100% “covalent,” 
with an electron near each nucleus. In short, the SCF-MO description does not properly 
describe the molecule as it dissociates. This means that the calculation of E versus 
R 4B for H2 will deviate from experiment more and more as R 4p increases. This defect 
in the SCF treatment of H2 occurs for many other molecular species also. 

Can we correct this defect through use of CI? We ask the question this way: “What 
configuration could we mix with log in order to make the mixture of covalent and ionic 
character variable?" Since 102 expands to give us covalent and ionic terms of the same 
sign, we need an additional configuration that will give them with opposite sign. Then 
admixture of the two configurations will affect the two kinds of term differently. The 
configuration that will accomplish this is 102: 


loy (1) lou (2) = Ng [154 (1) 154 Q) + 15g (1) 155 Q) — 154 (1) 155 (2) 
— Isg (1) 154 (2)] (11-37) 


Mixing these two configurations together gives 


V (1/62) = eilog (1) log (2) + c210u (1) 10u (2) 
= (e NE + e2N2) sa (1) 14 @ + Ise (0) 155 2) 


+ (c1N2—c2N2) sa (1) Isp 2) + Ise (1) 1542]. (11-38) 


If ci/c2 is readjusted at each value of R4g to minimize E, it is evident that the 
relative weights of covalent and ionic character in Eq. (11-38) will change to suit 
the circumstances. Actual calculations on this system show that, as R4g gets large, 
c1/c2 approaches a value such that c iNg + coN? approaches zero, so that the ionic 
component of y vanishes. 

This example illustrates that CI of this sort has an associated physical picture. 
It suggests that, in any CI calculation involving the dissociation (or extensive stretching) 
of a covalent bond, important configurations are likely to include double excitations 
into the antibonding virtual “mates” of occupied bonding MOs. 
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What about other configurations for H2? What will 10220; do for the calculation, 
assuming now an extended basis set has produced a 20g MO? Suppose we take as our 
trial function 


V — cilog + c210$20g (11-39) 


where the configurations are understood to stand for determinants. If the log MO has 
been produced by an SCF calculation on the ground state, and 2o; is a virtual MO from 
that SCF calculation, then it is possible to show that the CI energy minimum occurs 
when c» in Eq. (11-39) is zero. In other words, these determinants will not mix when 
they are combined in this way. An equivalent statement is that the mixing element 
Hi» = (log A |1og20g5) vanishes. Hence, the CI determinant (11-33) is already in 
diagonal form, and no variational mixing will occur. This is an example of Brillouin's 
theorem, which may be stated as follows: 


EXAMPLE 11-1 If D; is an optimized single determinantal function and D; is a 
determinant corresponding to any single excitation out of an orbital $ ; occupied in 
D; and into the virtual subspace (orthogonal complement) of D, then no improve- 
ment in energy is possible by taking y = c1 Dı +c2Dj;. 

The proof of Brillouin's theorem is very simple. We start with a basis set that 
spans a function space. An SCF calculation is performed, which produces the best 
single-determinantal wavefunction we can possibly get within this function space. 
This is Dj. D; differs from D; in only one orbital, which means they differ in only 
one row. A general property of determinants is that, if two of them differ in only 
one row or column, any linear combination of the two can be written as a single 
determinant (see Problem 11-4). This means that any combination c; D; + c2D; 
is still expressible as a single determinant. Since D; makes no use of functions 
outside our original basis set, cı Dı + c2D; is a single determinant within our 
original function space. However, D, is already known to be the single determinant 
within this function space that gives the lowest energy, and cj Dı + c2D; cannot 
do better. QED. 


A doubly excited configuration differs from D, in two rows, and mixing such 
a configuration with D; produces a result that cannot be expressed as a single 
determinant. 

Because of Brillouin's theorem, one might decide to omit all single excitations from 
CI calculations. But it is important to recognize that singly excited configurations can 
affect the results of CI calculations in the presence of doubly excited configurations. 
This comes about because nonzero mixing elements can occur between singly and 
doubly excited configurations in the CI determinant. To illustrate, let Yọ be an SCF 
single determinant, v be a singly excited configuration, and y? be a “double.” Then 
the CI determinant could be, assuming orthogonal determinants, 


Hoo — E 0 Hoz 
0 Hi-E Hi2 =0 (11-40) 
Hoz Hi» Hn -— E 
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The zeros result from Brillouin’s theorem. However, H1» does not necessarily vanish, 
and solution of this 3 x 3 determinantal equation leads to a wavefunction of the form 


V — copo - cii 4 evo (11-41) 


with c, not zero. y comes in on the coattails of V? and is referred to as a second- 
order correction. This is not a guarantee that it will be unimportant, however. 
(See Example 7-4 for similar behavior in a different context.) 

Another rule that is useful for recognizing configurations that may be omitted is 
the rather obvious one that each configuration must share the same set of eigenval- 
ues for operators commuting with the hamiltonian. That is, if y is to be associated 
with a particular symmetry, angular momentum, spin angular momentum, etc., then 
each configuration in y must have that same symmetry, angular momentum, etc. This 
means that, for the ground state of H2, lo? will not mix with logloy because the 
latter has overall u symmetry. 10520, could contribute, but the symmetrized combina- 
tion corresponding to the singlet state (|1oy20y| — |1o420,|) must be used rather than 
the (positive) triplet state combination. The configuration loy lzr, will not contribute 
because it has the wrong total angular momentum. 

Even with the aid of all these rules, a calculation on a molecule such as N2 or O5 
using a reasonably extended basis set gives rise to an enormous number of possible 
configurations. Additional rules of thumb have been found to help choose the major 
configurations. It has been found, for example, that triply or higher excited configura- 
tions are usually of lesser importance than doubly excited configurations. [Since the 
hamiltonian contains only one- and two-electron operators, interaction elements must 
vanish between the ground-state configuration and all triply or higher-excited configu- 
rations. But, like singly excited configurations, these can, in principle, come in on the 
coattails of doubly (or other) excited configurations.] In addition, a study of the energy 
change in some process involving primarily the valence electrons (e.g., stretching N2) 
really does not require calculation of the correlation energy of the 1s electrons since 
they are fairly unaffected by the change. Any correlation energy for these electrons 
tends to cancel itself when initial and final state energies are subtracted. Therefore, in a 
CI calculation of such a process, it is reasonable to omit configurations corresponding 
to excitation of a 1s electron unless high accuracy is desired. 

The acronym CID refers to a CI calculation in which only all doubly excited con- 
figurations are included. Inclusion of all singly and doubly excited configurations is 
referred to as a CISD calculation. Full CI (FCI) means all excited configurations have 
been included, and this is the limit that gives all of the correlation energy within the 
chosen basis set. The combination of full CI and a complete basis gives the exact energy 
(generally nonrelativistic and within the Born-Oppenheimer approximation). 


[J 11-13 Size Consistency and the Moller-Plesset 
and Coupled Cluster Treatments of Correlation 


Whenever certain parts of a well-defined procedure are omitted, as when full CI is trun- 
cated to CID or CISD, one must consider whether systematic errors are introduced. This 
is indeed the case in the above example. Suppose CID calculations are made for the 
energy of N» as a function of internuclear distance. At short distances, we treat the 
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system as a 14-electron molecule, including configurations in which 12 of the electrons 
are in their HF-occupied MOs. At very large distances we have two nitrogen atoms, 
which we normally treat as having twice the energy of one atom. Now CID on atom A 
includes the HF configuration, D as well as doubly excited configurations in which 
five of the seven electrons are in their HF-occupied AOs. Let De represent this class 
of configuration. Then y4 = co Dé +o DÅ. Atom B has a similar CID wavefunc- 
tion: YË = co DÈ +e D? . The wavefunction for the overall, noninteracting system is 
the antisymmetrized product of these wavefunctions. It will contain terms like D D? ; 
Dé DÈ, and Ds DÈ . There are no terms present corresponding to a single excita- 
tion at each atom, De p? , and such terms would be present in a CID treatment of the 
combined system. Also, there is a class of terms present corresponding to four pro- 
moted electrons, DA D? , and these terms would not be present in a CID treatment of the 
combined system. The dilemma is that, if we treat the system as a single 14-electron 
“molecule,” which is appropriate at small R, we mix in different terms than if we treat it 
as two separate atoms, which is appropriate at large R. If we choose some fairly large R 
value to redefine N> as two separate atoms, we change the nature of the CI in a discontin- 
uous way at an arbitrary point. This feature of truncated CI is called the problem of size 
consistency; CID and CISD methods are not size consistent. Doing separate calculations 
on each oftwo separated atoms and combining the energies yields a different result from 
doing a calculation on one system made up of two separated atoms. 

A correlation method that is size consistent has been developed by Pople and 
co-workers. It is based on perturbation theory that was introduced many years ago 
by Møller and Plesset.’ This approach divides the process of treating correlation into 
a series of corrections to an unperturbed starting point. If one chooses to do such a 
calculation to, say, third order (MP3, standing for Mgller—Plesset to third order), then 
the set of configurations to be included is determined by the perturbation formulas. It 
does not require further decision by the person doing the calculation and can be wholly 
managed by a computer program. Mgller—Plesset perturbation theory is different from 
standard CI in at least two important respects: It is size consistent, and it is not varia- 
tionally bound. One cannot assume, therefore, that going to higher and higher orders 
of perturbation will cause the calculated energy to approach closer and closer to the 
true energy from above. 

Because of the way MP theory defines the unperturbed system, the starting point 
energy (MPO) is the sum of HF one-electron energies. The first-order correction to 
the energy (MP1) brings in the appropriate electronic coulomb and exchange integrals, 
giving the correct HF energy. MP2 brings in contributions wherein doubly excited 
configurations “interact with" (1.e., occur in the same integral with) the ground con- 
figuration. MP3 adds contributions due to doubly excited configurations interacting 
with each other. MP4 brings in interactions involving also single, double, triple, and 
quadruple excitations. The selection of interaction terms by the perturbation formalism 
is what produces size consistency, but it leaves out certain terms at each level that would 
be included in a variational calculation. 

In coupled cluster (CC) approaches, which are also size consistent and generally 
not variationally bound, instead of including all configurations to a particular order as 


See Binkley and Pople [5]. 
7See Møller and Plesset [6]. 
8Perturbation theory is presented in Chapter 12. The present discussion avoids mathematical details. 
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in MP theory, each class of excited configurations is included to infinite order. This is 
accomplished via an exponential excitation operator, 


X n T? T3 
"cod eli -T4 | | Jr (11-42) 
21 3! 

where m is the HF determinant for an N-electron system, and 7 = fj 1+ f a+ 
f Zee f. N. T. 1 produces singly excited determinants, f 2 doubly excited ones, and 
so on. Because of the exponential nature of the excitation operator, each class of 
excitations is included to all orders, e.g., terms in f 2 would include products of double 
excitations (7. 2) that would be considered a subset of the possible quadruple excitations 
in CI. This is what makes CC theory size consistent. Usually coupled cluster theory 
is truncated to include just T, and 15, i.e., CCSD. One of the most accurate post-HF 
methods has been shown to be the CCSD(T) method,in which a CCSD calculation is 
followed by a contribution due to triple excitations ($3) via perturbation theory. 


N 11-14 Multideterminant Methods 


Up to this point, the methods that have been presented for describing electron cor- 
relation effects have been constructed with the single determinant SCF wavefunction 
as a Starting point. For most molecules near their equilibrium geometries, this is a 
very good zeroth-order approximation, but as we saw earlier for the H2 molecule, as 
covalent bonds are stretched towards dissociation multiple determinants are required 
for even a qualitative description. This puts much stronger demands on these so-called 
single reference methods, and their accuracy can be much degraded or even unphysical 
in these regions. In a multiconfigurational SCF (MCSCF) calculation one writes the 
wavefunction as a linear combination of determinants exactly as in a CI calculation, 
and the energy is minimized as a function of the linear CI coefficients. However, in 
an MCSCF calculation one also simultaneously optimizes the MO coefficients of the 
orbitals that are used to construct the determinants, using methods analogous to SCF 
theory. Because this greatly adds to the complexity of the calculation, the number of 
determinants used in MCSCF is generally much smaller than in a standard HF-based CI 
calculation. In the simplest case, only the additional determinants that allow for a qual- 
itative treatment of the process under study are included, e.g., one would include only 
the determinants corresponding to excitations of bonding electrons into their respective 
antibonding orbitals when stretching the triple bond of No. This procedure results in 
a set of MCSCF molecular orbitals (some strongly occupied, some weakly occupied) 
that smoothly changes in character from equilibrium to dissociation. 
In multireference CISD (MRCISD) calculations, the wavefunction is written as 


p=} apit} os Y cava (11-43) 
i s d 


where 5 '; cj Vj is the set of MCSCF reference determinants, Ys are new determinants 
formed by single excitations into the virtual orbitals relative to all of the reference 
determinants, and yq are doubly excited determinants. An MRCISD calculation of this 
type can yield a very balanced and accurate description of a molecule's potential energy 
surface, but often at a relatively steep cost in terms of computational requirements. 
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11-15 Density Functional Theory Methods 


The wavefunction y for an n-electron molecule is a function of 3n spatial coordinates 
and n spin coordinates. From yy we can produce the molecule's spin-free electron 
density function, o(1), by integrating w*w over all of the spin coordinates and all the 
the space coordinates except those for one of the electrons:? 


et f a.a... , Ny dodo ... dt, (11-44) 


which is a function of only the three spatial coordinates. We have seen that, in the 
early days of quantum chemistry, a major challenge was the evaluation of integrals 
over the interelectronic-repulsion term in the hamiltonian, as well as dealing with the 
related problem of electron correlation. Several methods were devised that attempted 
to approximate these quantities from the density function o(1), with moderate success. 
However, the continuing progress in computer speed and the development of sophis- 
ticated ab initio methods gradually shifted attention away from approaches using the 
density function. 

In 1964, proof by Hohenberg and Kohn [7] of a connection between the ground 
state energy, Eo, for a system and po, the ground state density function,!! sparked 
new interest in finding a rigorous way to go from knowledge of the attractively simple 
three-dimensional density function to a value for Ep. 

Recall that, for a system having n electrons and N nuclei, the hamiltonian operator 
for the electronic energy is 


1 n n N -Z n—l n 1 
H-2--3 V+) >). 53 = aa (11-45) 
2 i= i=] a=1 Tia i=l j=i+1 lij 


The first and last terms can be written down immediately if we know how many 
electrons are present, but the middle term depends on Y ; wi x æ, which is a function of 
nuclear charges and locations. This quantity is called the external potential, symbolized 
Vext (7), because it results from the presence of fields produced by particles not included 
in the group of electrons. 

Hohenberg and Kohn were able to prove that there is a uniqueness relation between po 
and the external potential: No two external potentials could give the same oo. This raises 
the possibility that one could work backwards from po to find ve; (7) and then Eg. The 
following route comes first to mind: Integrate pọ to get the number of electrons n. Figure 
out ve; (F) from pg. This would allow one to write down the hamiltonian operator. 
Then, using ab initio methods, one could get to an accurate Eo and Wo, and from wo 
one could calculate To, Virey, Veeg, and all the other properties of interest for the system. 

Two problems exist with this scenario. First, there is no generally applicable proce- 
dure known for getting from po to vex; (7). We can posit that ve; (7) is a functional of 
po, which we symbolize vext[p0], but we don't know what the functional relationship 
is. Second, even if we could get back to the hamiltonian operator, it would simply 


?Because V is antisymmetric for exchange of electrons, the density function is independent of our choice as to 
which electron's coordinates should be spared from integration. 

Recall that c is the spin coordinate and r is the coordinate for space and spin. dt = dvdo. 

1! We henceforth suppress the electron index in p. 
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land us back on square one: We would still have to solve the whole problem in the 
traditional way. Nevertheless, the hopes raised by this uniqueness theorem have led to 
the current goal of density functional theory, which is to find a procedure that takes us 
from p to E in a rigorous way that avoids the complexities of landing on square one 
and proceeding using standard ab initio methods. [8] 

A subsequent relation proved by Hohenberg and Kohn [7] indicated a way to proceed. 
They proved that an approximate density function, p0,approx» When subjected to the 
(unknown) procedure that relates the exact oo to the exact Eo, must yield an energy 
higher than the exact Eo: Eo,approx = Eo, so a variational bound exists. Note that the 
unknown process referred to here is one that assumes vé; (7) to be the same for the 
analysis of oo and p0,approx, Which means that the same nuclear framework applies in 
both cases. 

If a procedure were known for finding E from p, then the existence of a variational 
bound would allow a variational procedure analogous to what we have applied earlier. 
One would start with a trial o, calculate its energy, and vary p to locate the p that gives 
the lowest energy. 

The barrier to proceeding is the lack of a way to get E from p. Hence, the devel- 
opment of approximate functionals that relate the energy to the electron density is an 
extremely active area of current research and probably will be for some time to come. 

In analogy to wavefunction methods, the functional that connects E to p, E[p], can 
be separated into an electronic kinetic energy contribution, 7'[o], a contribution due 
to nuclear-electron attractions, E,e[o], and the electron-electron repulsions, FEe[p]. 
The latter term can be further decomposed into Coulomb and exchange terms, J[p] 
and K[p]. Both the nuclear-electron attraction and the interelectronic Coulomb terms 
can be easily written in terms of the density using their classical expressions as in 
wavefunction methods. For an accurate treatment of the electronic kinetic energy term, 
however, one must differentiate a wavefunction,? and this has led to the practice first 
proposed by Kohn and Sham [9] of expressing the density in terms of one-electron 
orbitals ¢ (constructed numerically or from a basis set of Slater or gaussian functions). 
These orbitals serve two purposes. They allow us to calculate a value of the kinetic 
energy within a single Slater determinant framework similar to Hartree—Fock theory, 


n 


1 
Ts =) (bil - 5V"ldi) (11-46) 


i=1 


and to obtain the electron density, defined in terms of these Kohn-Sham orbitals as 
n 
$i le (11-47) 
i 


The final DFT energy expression is then written as 


Eprrle]=Tsle] + Enelo] + JLo] + Exclo] (11-48) 


where the exchange correlation functional E;.[p] contains the difference between the 
exact kinetic energy and 75, the nonclassical (exchange) part of electron-electron repul- 
sions, K[o], and correlation contributions to both K[p] and J[o]. The Kohn-Sham 


124s far as we know, we must differentiate a wavefunction to get kinetic energy. If there is a functional that 
permits us to obtain kinetic energy directly from the density function, we might avoid having to use orbitals. 
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orbitals are eigenfunctions of an effective one-electron hamiltonian that is nearly identi- 
calin form to the Fock operator in the SCF equations. In the Kohn-Sham case, however, 
the HF exchange operators are replaced by the functional derivative of the exchange 
correlation energy. Assuming the existence of E,.[o] and an initial guess for the elec- 
tron density, one then solves the Kohn-Sham eigenvalue equations for the orbitals, 
which can then be used to define a new electron density and effective hamiltonian. 
These iterations continue until the density is converged to within a specified threshold. 

The exact form of E;;[o] is not currently known, however, and a rapidly grow- 
ing list of approximate exchange correlation functionals have appeared in the litera- 
ture. Because these are all estimates of a part of the overall energy, the total energy 
finally calculated is not an upper bound to the true energy. Also, DFT is not size- 
consistent. 

Generally, most existing exchange correlation functionals are split into a pure 
exchange and correlation contribution, E,[o] and E;[o] and the current functional 
nomenclature often reflects this with two-part acronyms, e.g., the BLYP DFT method 
uses an exchange functional from Becke (B) [10] and a correlation functional by Lee, 
Yang, and Parr (LYP) [11]. In principle, the exchange contribution could be calculated 
exactly (for a single determinant) in the same manner as 7s, but this is generally not 
done since this disturbs the balance between E,[o] and E;[o]. In hybrid DFT, a 
percentage of this exact exchange is included in E, [p]. 

The great benefit of present day DFT methods is computational cost. With the 
exchange correlation functionals commonly used, the computational effort is similar 
to a SCF calculation, but since E,,[o] implicitly includes some amount of electron 
correlation, the accuracy of DFT (depending on the chosen functional) is often similar 
to that obtained with MP2 or better. The great weakness of DFT at the present time, 
however, is the inability to systematically improve upon £,-[p] and converge towards 
the exact Born-Oppenheimer energy like one might conceptually do in a wavefunction- 
based CI or CC calculation, e.g., SCF, CCSD, CCSDT, CCSDTQ, etc. with sequences 
of correlation consistent basis sets. 

One of the simplest DFT methods is the local density approximation (LDA), which 
assumes the density behaves locally like a uniform electron gas. Generally this does 
not lead to an accurate description of molecular properties, but if one makes E, [o] and 
E.[o] depend also on the gradient of the density, yielding gradient corrected DFT or 
the generalized gradient approximation (GGA), the results are much more accurate. 
Finally, the definition of £;«[o] also lends itself to semiempirical contributions. One 
such parameterization that has been very successful is the B3LYP hybrid DFT method, 
which includes 2096 exact exchange and involves three semiempirical parameters that 
were obtained by fits to experimental thermochemical data (heats of formation, etc.) 
of small molecules [12]. 


[J 11-16 Examples of Ab Initio Calculations 


Self-consistent-field and correlated calculations have now been made for a very large 
number of systems. The best way to judge the capabilities of these methods is to survey 
some of the results. 13 


13 For extensive surveys, see Schaefer [13], Hehre et al. [14], and Raghavachari [15]. 
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Table 11-2 provides information on energies for a number of atoms in their ground 
states. Self-consistent-field energies are presented for three levels of basis set com- 
plexity. In the STO single-Z level, a minimal basis set of one STO per occupied AO is 
used, and the energy is minimized with respect to independent variation of every orbital 
exponent ¢. The STO double-¢ basis set is similar except that there are two STOs for 
each AO, the only restriction being that the STOs have the same spherical harmonics 
as the AOs to which they correspond. 

The Hartree—Fock energies are estimated by extrapolating from more extensive basis 
sets, and represent the limit achievable for the SCF approach using a complete basis 
set. We can make the following observations: 


1. The improvement in energy obtained when one goes from a single-¢ to a double-Z 
STO basis set is substantial, especially for atoms of higher Z. 


2. 'The agreement between the optimized double-Z data and the HF energies is quite 
good. Even for neon, the error is only about 107? a.u. (0.27 eV). Thus, for atoms, 
the double-Z basis is capable of almost exhausting the energy capabilities of a 
single-configuration wavefunction. 


3. The disagreement between HF and "exact" energies (i.e., the correlation energy) 
grows progressively larger down the list. For neon it is almost 0.4 a.u. (10eV), 
which is an unacceptable error in chemical measurements. 


One might think that the magnitude of the correlation energy in these examples 
would make SCF calculations on heavy atoms useless for quantitative purposes, but this 


TABLE 11-2 » Ground-State Energies (in atomic units) for Atoms, as Computed by the SCF 
Method and from Experiment 


STO 
Correlation 
Atom Single £^ Double Z^ Hartree—Fock^ Exact^ energy 
He —2.8476563 —2.8616726 —2.8616799 —2.9037 —0.0420 
Li —7.4184820 | —7.4327213 —7.4327256 —7.4774 |  —0.0447 
Be —]4.556740 | —14.572369 —14.573021 —14.6663 —0.0933 


B —24.498369 —24.527920 —24.529057 —24.6519 —0.1228 
C —37.622389 —37.686749 —37.688612 —37.8420 —0.1534 
N —54.268900 —54.397951 —54.400924 —54.5849 —0.1840 
O —74.540363 —74.804323 —74.809370 —75.0607 —0.2513 
F —98.942113 —99.401309 —99.409300 —99.7224 —0.3131 
Ne —127.81218 —128.53511 —128.54705 —128.925 —0.378 

Ar —525.76525 —526.81511 —526.81739 —527.542 —0.725 


“From Roetti and Clementi [16]. 

“Exact” equals experimental with relativistic correction but without correction for Lamb shift. See Veillard 
and Clementi [17]. 

“Correlation energy is "exact" minus HF energy. 
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is not the case. Most frequently we are not concerned with the value of the total energy 
of a system so much as with energy changes (e.g., in spectroscopy or in reactions) or 
else with other properties such as transition moments (for spectroscopic intensities) or, 
in molecular systems, dipole moments. 

Let us, therefore, see how well SCF calculations can predict atomic ionization ener- 
gies. We have already indicated (Section 11-11) that there are two ways we can get 
ionization energies from SCF calculations. The first, and simplest, is to take the vari- 
ous —ej, as suggested by Koopmans’ theorem. Table 11-3 shows that this gives only 
rough agreement with experimental values for neon. Another way is to do separate SCF 
calculations for each excited state produced by removal of an electron from an orbital 
(i.e., for each “hole state") and equate the ionization energies to the energy differences 
between these and the neutral ground state (ASCF). This second method requires much 
more effort. As Table 11-3 indicates, however, the extra effort leads to great improve- 
ment in agreement between theoretical and experimental values. We conclude that SCF 
calculations on atoms and ions give quantitatively useful data on ionization energies, 
even for ionization out of deep-lying levels. The Koopmans’ theorem approach is less 
accurate, although still qualitatively useful. 

A related problem is the calculation of energies of excited states of atoms. Weiss 
[19] has reported calculations on some of the excited states of carbon, and his results are 
summarized in Fig. 11-3. Inspection of this figure reveals that near-HF calculations only 
roughly reproduce the energy spectrum, but CI (with four or five configurations) brings 
about marked improvement. Weiss has omitted configurations involving excitations of 
1s electrons, and so these results ignore correlation energy for the inner-shell electrons. 
The agreement suggests that these electrons experience almost no change in correlation 
for transitions among these states. Weiss has also calculated oscillator strengths!* 
associated with atomic transitions and he finds that CI is necessary before reasonable 
agreement with experiment is achieved. 


TABLE 11-3 » Ionization Energies of Neon? 


Ionization potential (a.u.) 


Ion configuration Koopmans ASCF Experiment 


1s2s?2pÓ 32.7723 31.9214 31.98 
1s?2s2p° 1.9303 1.8123 1.7815 
18728?2p? 0.8503 0.7293 0.7937 


“From Bagus [18]. The basis set includes 5 s-type and 12 p-type STOs 
(4 of each m quantum number). £'s were varied as well as linear coef- 
ficients. The neutral ground state gives E — —128.547 a.u. (compare 
Table 11-2). 


l^The oscillator strength is a measure of the probability (i.e., intensity) of a transition. For a transition between 
states a and b in a 2n-electron system it is commonly given by the formula 
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Figure 11-3 » Transition energies in the C* ion as calculated by HF, CI, and as measured. (From 
Weiss [19].) Ionization from the ground state of Ct occurs at 0.8958 a.u. 


In brief, then, the evidence indicates that reasonably accurate atomic ionization 
energies can be obtained by high-quality SCF calculations on the neutral and ionized 
species (ASCF, not —e), but that transition energies and intensities require CI sufficient 
to account for much of the valence electron correlation. 

Before we leave the subject of atoms, it should be pointed out that, for any atom, the 
expectation value T of the kinetic energy operator is equal to — E if the wavefunction 
has been optimized with respect to a scale factor in the coordinates r1, r2, etc. This 
relation, called the virial relation, is proved in Appendix 8. It is necessarily satisfied 
for any level of calculation that cannot be improved by replacing every r; in Y by nri 
and allowing 5 to vary. Since the single- and double-Z STO solutions have already 
been optimized with respect to such scale parameters, they satisfy the virial relation. 
Thus, for the beryllium atom, the single-Z STO value for E is (Table 11-2) —14.556740 
a.u., and so we know that T — --14.556740 a.u. and V — —29.113480 a.u. for this 
wavefunction (since E = T + V). For the double-£ wavefunction T = --14.572369 
a.u., etc. The Hartree-Fock wavefunction is, by definition, the lowest-energy solution 
achievable within a restricted (single determinantal'>) wavefunction form. Use of a 
scale factor does not affect the wavefunction form. Hence, no further lowering of E 
below the HF level is possible in this way, and the HF energies E, T, and V must 
satisfy the virial relation also. Finally, the exact energies are the lowest achievable for 


15For open-shell systems, more than one determinant may be needed to satisfy symmetry requirements. This 
is still considered a HF wavefunction. 
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any wavefunction. Again, scaling cannot lower the energy further, so these energies 
also satisfy the virial relation. 

We turn next to ab initio calculations on molecules. First, let us compare HF and 
exact energies for molecules as we did for atoms and see how large the errors due 
to correlation are. The results are not too different from those for atoms having the 
same number of electrons, as shown in Table 11-4; that is, the correlation energies 
for molecules having ten electrons (CH4, NH3, H20, HF) are about the same as that 
for neon, whereas that for the 18-electron molecule H202 is more like the correla- 
tion energy for argon. But this is only a very rough rule of thumb. We have already 
indicated that the correlation energy in a molecule varies with bond length, a factor 
not present in atomic problems. In order to get a more meaningful idea of the capa- 
bilities of ab initio calculations on molecules, we must look more closely at specific 
examples. 

A calculation on the OH radical, reported by Cade and Huo [21], provides a good 
example of the capabilities of the extended basis set LCAO-MO-SCF technique on a 
small molecule. Their final wavefunction for the ground state at an internuclear sep- 
aration R = 1.8342 a.u. is presented in Table 11-5. A minimal basis set of STOs for 
OH would include 1s, 2s, 2px, 2py, and 2p, STOs on oxygen and a single 1s AO on 
hydrogen. Cade and Huo chose a much more extensive basis. Oxygen is the site 
for two 1s, two 2s, one 3s, four 2p, one 4f, eight 2px, two 3d,,, and four 4f, STOs. 
On hydrogen, there are two 1s, one 2s, one 2p,, two 2px, and two 3d, STOs. (The 
x-type basis functions are indicated in Table 11-5 for only one of the two directions 
perpendicular to the O-H axis.) The orbital exponents for all of these STOs have 
been optimized, and the resulting wavefunction is of “near-Hartree—Fock” quality. The 
optimized ¢ values appear in Table 11-5. The STO labeled c2p, is located on oxygen 
and has the formula 


o 2p), = (2.13528)? ; - V?» exp(—2.13528r) cos 0 (11-49) 


TABLE 11-4 » Estimated Hartree-Fock and Correlation Energies for Selected Molecules^ and 
Atoms? 


Molecule E (HF) E (correlation) Molecule E (HF) E (correlation) 


or atom (a.u.) (a.u.) or atom (a.u.) (a.u.) 

H2 —1.132 —0.043 Ne —128.547 —0.378 
He —2.862 —0.042 CO —112.796 —0.520 
BH3 —26.403 —0.195 N2 —108.994 —0.540 
OCD) —74.729 —0.262 Si (D)  —288.815 —0.505 
CH4 —40.219 —0.291 B5Hg —52.835 —0.429 
NH3 —56.225 —0.334 S D) —397.452 —0.606 
H20 —76.067 —0.364 H202 —150.861 —0.688 
HF — 100.074 —0.373 Ar —526.817 —0.725 


a From Ermler and Kern [20]. 
See Table 11-2. 
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TABLE 11-5 » Near Hartree-Fock Wavefunction for the OH Molecule in Its Ground-State 
Configuration (10220736? 12?) at R = 1.8342 a.u.^ 


Xo Cio Cos Cas Xn Ciz 


also (€ — 7.01681) 0.94291 —0.25489 0.07625 m2po(£ = 1.26589) 0.37429 
o lso’ (12.38502) 0.09313 0.00358 —0.00153 z2p,' (2.11537) 0.46339 


02s, (1.71794) —0.00162 0.46526 —0.20040 2p,” (3.75295) 0.23526 
0280! (2.86331) 0.00418 0.55854 —0.18328 z2ps" (8.41140) 0.01023 
03s, (8.64649) —0.03826 —0.02643 0.00550 m3do (1.91317) 0.02871 
0 2po (1.28508) —0.00055 0.05179 0.30153 m4fp (2.19941) 0.00506 


0 2p,’ (2.13528) —0.00056 —0.07538 0.37791 m2py (1.76991) 0.02442 
o 2po" (3.75959) 0.00115 0.01874 0.18390 m3dy (3.32513) 0.00282 
c 2p.” (8.22819) 0.00059 0.00229 0.00952 
o 3d, (1.63646) —0.00047 0.02437 0.04676 
o 3d, (2.82405) 0.00016 0.00845 0.01595 
o 4f, (2.26641) —0.00013 0.00882 0.01232 
o lsu (1.31368) 0.00150 —0.04651 0.21061 
o lsp’ (2.43850) | —0.00034 0.09413 0.05113 
o2sg (2.30030) 0.00000 0.07654 0.04539 
o 2py (2.8052) 0.00018 0.01182 0.00999 


“From Cade and Huo [21]. 


There are three o-type MOs and two x-type MOs to accommodate the nine electrons 
of this radical. One x-type MO is 


lr, = 0.374297 2poy + 0.463392 2p, + 0.235267 2p, 
+ 0.01023z2p;, + 0.028717 3doy + 0.005067 4f, 
+ 0.024427 2ppy + 0.002827 3dHy (11-50) 


and the other occupied x-type MO would be the same except with x instead of y. (The 
zaxis is coincident with the internuclear axis.) Itis evident that writing out the complete 
wavefunction given in Table 11-5 would result in a very cumbersome expression. It is 
a nontrivial problem to relate an accurate but bulky wavefunction such as this to the 
kinds of simple conceptual schemes chemists like to use. One solution is to have a 
computer produce contour diagrams of the MOs. Such plots for the valence MOs 20, 
3c, and 1x of Table 11-5 are presented in Fig. 11-4. 

Cade and Huo [21] carried out similar calculations for OH at 13 other internuclear 
distances and also for the united atom (fluorine) and the separated atoms in the states 
with which the Hartree-Fock wavefunction correlates. Some of their data are repro- 
duced in Table 11-6. A plot of the electronic-plus-nuclear repulsion energies is given 
in Fig. 11-5 along with the experimentally derived curve. It is evident that the near HF 
curve climbs too steeply on the right, leading to too "tight" a potential well for nuclear 
motion and too small an equilibrium internuclear separation. This comes about because, 
as mentioned earlier, the HF solution dissociates to an incorrect mixture of states, some 
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Figure 11-4 » Contour plots of HF valence orbitals for OH as given in Table 11-5. (From 
Stevens et al. [22].) 


TABLE 11-6 » Spectroscopic Parameters and Dipole Moment for OH from Theoretical Curves 
and from Experiment? 


Dipole 
moment Re De We WeXe Qe 
Wavefunction (debyes) (a.u.) (eV) (cm-!) (cm7!) (cm7!) 


SCF 1.780 1.795 8.831 4062.6 165.09 0.661 
CI 1.655 1.838 4.702 3723.6 83.15 0.628 
Experimental 1.66 +.01 1.834 4.63 3735.2 82.81 0.714 


“From Stevens et al. [22]. 


of which are ionic. It is possible to use the HF curve of Fig. 11-5 to derive theoretical 
values for molecular constants that can be compared to spectroscopic data. The results 
are displayed in Table 11-6, and they reflect the inaccuracy in the HF energy curve. 
Included there are the SCF and experimental values for the molecular dipole moment. 

We turn now to the behavior of V /T for the HF wavefunctions of Cade and Huo 
at various internuclear separations. The data appear in Table 11-7. Observe that the 
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Figure 11-5 » Theoretical and experimental energy curves for OH (from Stevens et al. [22].) 


value of —2.00000 for V / T occurs at three values of R: 0, oo, and the point where E 
is a minimum. At R =0 and co, we are dealing with one or two atoms, for which we 
have already seen the HF solution should give V /T = —2. At intermediate R we have 
a diatomic molecule, for which the virial relation is (see Appendix 8) 


2728 LE (11-51) 
aR i 


There are three cases to consider. If 9E/8 R =0, then V/T — —2. This will occur 
at the minimum of the potential energy curve (and also at any subsidiary maxima or 
minima). If 0 E/O R is negative, then, since V /T = —2 — (R/T)(0E/0R) and T is 


TABLE 11-7 » HF Total Energies and V/T for OH as a Function of Internuclear Distance^ 


R (a.u.) E (a.u.) V/T R (a.u.) E (a.u.) V/T 

0 —99 40933 —2.00000 1.90 —15.41837 —2.00129 
1.40 —75.34382 —1.99076 2.00 —75.41140 — —2.00225 
1.50 —75.38378 — 1.99398 2.10 —715.40163  —2.00300 
1.60 —75.40696 —1.99651 2.25 —75.38372  —2.00380 
1.70 —75.41829 —1.99850 2.40 —75.36367  —2.00433 
1.75 —75.42065 — 1.99933 2.60 —75.33582 | —2.00474 
1.795 —715.42127 — 2.00000 2.80 —715.30822  —2.00492 
1.8342 —75.42083 —2.00052 oo —75.30939  —2.00000 


“From Cade and Huo [21]. 
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positive, V /T will be algebraically higher than —2 (e.g.,—1.98). If 9 E/8R is positive, 
V /T will be lower than —2. Thus, the values of V /T in Table 11-7 reflect the slope of 
a line tangent to the potential energy curve at each R value. 

As mentioned earlier, it is possible to at least partly include the effects of electron 
correlation by allowing determinants corresponding to other configurations to mix into 
the wavefunction. Such calculations have been performed for the OH radical by several 
groups, and the results of Stevens et al. [22] are included in Table 11-6 and Fig. 11-5. 
These data come from intermixing 14 configurations. It is evident that the inclusion of 
correlation through CI has markedly improved the agreement with experiment. 

Many diatomic molecules have been treated at a comparable and higher level, and it 
is clear that ab initio calculations including electron correlation are capable of giving 
quite accurate molecular data. In cases in which the diatomic system is experimentally 
elusive, such calculations may be the best source of data available. A further example of 
this is provided in Table 11-8, in which are listed dipole moments for ground and some 
excited states of diatomic molecules. The dipole moments computed from near-HF 
wavefunctions contain substantial errors. It can be seen that CI greatly improves dipole 
moments. It has been observed that inclusion of singly excited configurations is very 
important in obtaining an accurate dipole moment. 

As a general rule, CID correlates electron motion and therefore has a significant 
energy-lowering effect but has little effect on the one-electron distribution or related 
properties, like dipole moment. Inclusion of singly excited configurations (CISD) 
allows the one-electron distribution to shift in response to the change in calculated 
interelectronic repulsion. For example, the value of the ground-state dipole moment 
of CO (entry 4 of Table 11-8) is calculated at the CID level to be —0.20D and at the 
CISD level to be 4-0.12 D. Thus CID may be a suitable level of computational effort if 
the interest is in energy, but CISD is better if the interest is in one-electron properties. 


TABLE 11-8 » Calculated and Experimental Dipole Moments of Diatomic Molecules 
(in Debyes) 


Molecule and polarity State — HF at Ref  Clat Re? Experiment Reference 


LitH- x!x* 6.002 5.86 5.83 [23] 
C*N- X?2x* 2.301 1.465 1.45 0.08 [24] 
C-N* Bo» - 0.958 1.15 +0.08 [24] 
C-O* X!x*  —0274 0.12 0.112 2- 0.005 [23] 
C*Oo- AÀ?n 2.34 1.43 1.37 [25] 
qst xd» 1.56 2.03 1.97 [23] 
Q-S* ATTI —0.09 0.63 0.63 + 0.04 [26] 
C-H* xX? 1.570 1.53 1.46 +0.06 [27] 
O-H* x? 1.780 1.655 1.66+0.01 [27] 
F-H* X 1.942 1.805 1.797 [27] 
N-H* Xĉ?E- 1.627 1.537 Unknown [27] 


“Experimental Re Value used. 
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Highly accurate properties can be obtained with more sophisticated electron correlation 
methods, such as CCSD(T) or MRCISD. 

Other examples of the use of ab initio methods on small molecules are shown in 
Table 11-9, which displays some calculated properties for the electronic ground states of 
H» and N2 asa function of method with the cc-pVTZ basis set. In the case of H2, the SCF 
bond length and harmonic frequency are both slightly too large compared to experiment, 
but the dissociation energy is underestimated by more than 30 kcal/mole due to the lack 
of electron correlation. For this two-electron system, the CISD and CCSD methods are 
equivalent to a FCI and exhibit marked improvement compared to SCF. The remaining 
deviations from experiment at this level of theory can be attributed to the use of the 
finite cc-pVTZ basis set. The MP methods show systematic improvement with each 
order of perturbation theory, but even a fourth-order treatment of single and double 
excitations results in non-negligible errors compared to FCI for this simple system. The 
B3LYP hybrid density functional method is observed to yield very reliable properties 
in this case. 

As might be expected due to its triple bond, the Nz molecule is considerably more 
challenging for ab initio methods. With the cc-pVTZ basis set, the SCF dissociation 
energy is smaller than experiment by nearly a factor of 2. Appreciable differences are 
now observed between the CISD and CCSD results, with the latter being somewhat 
closer to experiment. In addition, triple excitations, as measured by the difference 
between CCSD and CCSD(T), are relatively important for No, raising the dissociation 
energy by nearly 9kcal/mole. In contrast to the H2 case, the results for N2 using 
perturbation theory (MP2, MP3, MP4) display a disturbing oscillatory behavior. This 
type of result with MP methods has been the subject of several previous studies.!© 


TABLE 11-9 » Calculated Equilibrium Bond Lengths, Harmonic Vibrational Frequencies, and 
Equilibrium Dissociation Energies for the Ground States of Hz and N3 with the cc-pVTZ Basis Set 
Compared to Experiment 


H2 N2 
De De 
re(A) | c, (cm- 1) (kcal/mole) re (A) @e(cm—!) (kcal/mole) 
SCF 0.734 4587 83.7 1.067 2732 120.4 
CISD 0.743 4409 108.4 1.089 2509 193.1 
MP2 0.737 4526 103.6 1.114 2195 228.7 
MP3 0.739 4476 107.1 1.090 2532 206.0 
MP4 0.741 4441 108.0 1.113 2192 221.2 
CCSD 0.743 4409 108.4 1.097 2424 207.7 
CCSD(T) 1.104 2346 216.5 
B3LYP 0.743 4419 110.3 1.092 2449 229.6 
Expt. [28] 0.741 4403 109.5 1.098 2359 228.4 


16 See Dunning and Peterson [29] and references therein. 
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Lastly, it is again the case that the B3LYP method yields relatively accurate results for 
this molecule and appears to be comparable in quality to MP2. 

The results shown in Table 11-10 explore the choice of basis set with the CCSD and 
CCSD(T) methods for the Hz and N» molecules, respectively. A large dependence on 
basis setis observed in each case. The use of a minimal basis set, STO-3G, leads to large 
errors since it provides very few virtual orbitals for electron correlation. Just a double-Z 
basis set, either 6-31G** or cc-pVDZ, is observed to be a great improvement. The 
systematic convergence of the correlation consistent basis sets is readily observed in 
these results. One should note that increasing the size of the basis set from cc-pVTZ to 
cc-pV5Z in N3 results in an increase in De by nearly 9 kcal/mole. This implies that the 
highly accurate result for De shown in Table 11-9 for the MP2 level of theory with the 
cc-pV TZ basis set was clearly fortuitous. From these results it should be obvious that 
errors due to basis set incompleteness can often rival those due to inadequate electron 
correlation. 

As shown above, the hybrid DFT method B3LYP can be competitive in accuracy to 
more computationally expensive methods, such as CCSD(T). In fact, recent benchmark 
calculations [30] have shown that for the calculation of thermochemical quantities like 
enthalpies of formation, B3LYP exhibits average errors of only 1—5 kcal/mol. While 
these are still more than a factor of two larger than the accuracy obtainable with coupled 
cluster methods, the much lower computational cost of B3LYP makes it a very attrac- 
tive alternative. The accuracy of equilibrium bond lengths and harmonic vibrational 
frequencies calculated by B3LYP have also been shown to be very satisfactory. The 
accurate calculation of some molecular properties, however, is still a great challenge to 
hybrid DFT methods. In particular, reaction activation energies are often too small and 
van der Waals interactions can be qualitatively incorrect. Correcting these deficiencies 
is the goal of many second generation hybrid DFT functionals.!7 

We have seen that inclusion of electron correlation often improves the E versus 
R curve because it allows for variable ionic-covalent character in the wavefunction. 


TABLE 11-10 » Dependence on Basis Set Choice for the CCSD and CCSD(T) Properties of H2 
and No, respectively 


H5/CCSD N5/CCSD(T) 
De De 
re(A) | ce (cm- ') (kcal/mole) re (A) @e(cm—!) (kcal/mole) 
STO-3G 0.735 5002 128.1 1.190 2145 147.8 
6-31G** 0.738 4504 105.9 1.120 2342 201.6 
cc-pVDZ 0.761 4383 103.6 1.119 2339 200.6 
cc-pVTZ 0.743 4409 108.4 1.104 2346 216.5 
cc-pVQZ 0.742 4403 109.1 1.100 2356 222.9 
cc-pV5Z 0.742 4405 109.3 1.099 2360 225.1 
Expt. [28] 0.741 4403 109.5 1.098 2359 228.4 


17 See, for instance, Zhao et al. [31] and Xu et al. [32]. 
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However, there are some diatomic molecules that maintain a high degree of ionic 
character even when the nuclei are quite widely separated. NaCl is an example. For 
such systems, the Hartree—Fock energy curve is quite nearly parallel to the exact energy 
curve throughout the minimum energy region (i.e., the correlation energy is almost 
constant) and the theoretical values of spectroscopic constants agree quite well with 
experimental values. (Jn vacuo, an electron ultimately transfers from Cl~ to Nat, and 
the experimental curve leads to neutral dissociation products, whereas the HF curve 
does not. This theoretical error affects the curve only at large R, however, and so has 
little effect on spectroscopic constants.) Schaefer [13] has reviewed this situation. 

Of course, ab initio calculations have been performed on molecular systems much 
larger than the molecules referred to above. However, as one moves to molecules 
having four or more nuclei, one encounters a new difficulty: Integrals now appear that 
have the form 


(ab | cd) = (xa (1) xa (2) |1/r12| Xe) xa Q2) (11-52) 


where x, is a basis function located on nucleus a, etc. Such integrals have basis func- 
tions on four different nuclei and are referred to as four-center integrals. If the basis 
functions x are STOs, such integrals are relatively slow to evaluate on a computer. If 
they are gaussian functions, the computation is much faster, and this is the main reason 
for using gaussian basis functions. But the number of such integrals becomes enormous 
forareasonable basis set and a medium sized molecule. In fact, the number of such inte- 
grals grows as the fourth power of the number of basis functions. Thus, replacing each 
STO by, say, three gaussian functions, will lead to 3^ times as many integrals to eval- 
uate. Even though such integrals can be evaluated very rapidly, we eventually come to 
molecules of such a size that the sheer number of integrals makes for a substantial com- 
puting effort. The efficient calculation of molecular integrals continues to be an active 
research area, however, and new techniques have now diminished the importance of this 
bottleneck with reasonably sized gaussian basis sets on systems up to hundreds of atoms. 

Modern quantum chemical programs have made high-quality calculations on rea- 
sonably large molecules tractable, but one is always balancing the level of accuracy 
against the computer time needed to achieve it. While a Hartree-Fock calculation on 
benzene with a cc-pVTZ basis set (264 contracted gaussian functions) might require 
just 4 minutes to complete on a given computer, inclusion of electron correlation at 
the MP2, CCSD, and CCSD(T) levels would require an additional 0.1, 4.3, and 11 times 
4 minutes, respectively. 

Numerous calculations have been reported for barriers to internal rotation in various 
molecules. The theoretical barriers agree best with experiment for molecules having 
threefold symmetry in the rotor. Self-consistent-field values are compared with experi- 
mental barrier values in Table 11-11. In every case, the theoretical energy curve predicts 
the correct stable conformation and even does reasonably well at predicting barrier 
height. The disagreement between different computed values of the barrier for the 
same molecule reflects differences in basis sets and, sometimes, differences in choices 
for bond length and angle made by different workers. The evidence to date suggests 
that ab initio calculations approaching the HF limit will ordinarily be within 20% of 
the experimental barrier. Even this level of accuracy is useful because experimental 
measurements of barriers in transient molecules or for excited molecules are often very 
rough, ambiguous, or nonexistent. Given the favorable cost and relative accuracy of 
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TABLE 11-11 » Internal Rotation Barriers from Experiment and as 
Calculated by the LCAO-MO-SCF Method 


Barrier (kcal/mole) 


Molecule SCF Experiment Reference 
CH3-CH3 2.58 2.88 [33] 
2.88 = [34] 
CH3-NH» 1.12 1.98 [35] 
2.02 — [34] 
CH3-OH 1.59 1.07 [34] 
CH3-—CH2F 2.59 3.33 [33] 
CH3-N=O 1.05 1.10 [36] 
CH3-CH-CH? 1.25 1.99 [37] 
cis-CH3-CH—CFH 1.07 1.06 [37] 
trans-CH3-CH=CFH . 1.34 2.20 [37] 
CH3-CH=O 1.09 1.16 [38] 


DFT approaches compared to HF, even higher quality results might be expected with 
the use of methods such as B3LYP; hence, DFT is often now the method of choice for 
calculations on medium to large organic molecules. 

A large number of ab initio calculations have been made on clusters of molecules. 
Many of these have sought to delineate the distance and angle dependence of the hydro- 
gen bond strength between molecules like water or hydrogen fluoride. Xantheas et al. 
[39] have reported large basis set MP2 calculations on small water clusters, (H2O),, 
where n ranged from 2-6. These calculations predict that there are four distinct isomers 
of the water hexamer (n = 6) whose relative energies lie within ^1 kcal/mole of each 
other. These kinds of results are of great usefulness in defining new effective interac- 
tion potentials involving water that can be used in large-scale molecular simulations of 
solvation phenomena. Re et al. [40] have calculated the structures and relative energies 
of sulfuric acid solvated by 1—5 water molecules using the B3LYP method to provide a 
fundamental understanding of acid ionization. In addition to investigating the interac- 
tion of water with both the cis and trans conformers of H2SOa, they found that just five 
water molecules were sufficient to make dissociation into HSO, and H3O* energetically 
favorable. The field of materials science is also benefitting from ab initio calculations, 
and studies of metal clusters and their absorbates are currently areas of high interest. 

A great deal of attention has been given to the calculation by ab initio methods of 
energy surfaces for chemical reactions. For many years, such efforts were limited to 
reactions, such as D + H? — HD + H, which involve only a small number of electrons 
and nuclei. Much more complicated systems are now being explored. 

Insetting outto perform such a calculation, one likes to have some idea of whether the 
correlation energy of the system will change significantly with nuclear configuration. 
If it does not, then a Hartree-Fock or MCSCF calculation will parallel the true energy 
surface. If the correlation energy does change, it is necessary to include some treatment 
of electron correlation in the calculation. 


Section 11-16 Examples of Ab Initio Calculations 383 


As arough rule of thumb, one expects the correlation energy to change least when the 
reactants, the intermediate or transition state complex, and the products are all closed- 
shell systems, hence all approximately equally well described by a single-determinantal 
wavefunction. Some calculations on Sy» and radical reactions are summarized in 
Table 11-12. It can be seen that the S,» reactions, which do involve closed-shell 
systems in the three stages mentioned above, are fairly insensitive to the inclusion of CI, 
whereas the radical reactions undergo extensive change of correlation energy. 

The determination of the potential energy surface for the unimolecular rearrange- 
ment HOCI — HCIO by Peterson et al. [43] provides an example of a very accurate 
and exhaustive calculation on a fairly small molecule. Because there are only three 
nuclei, there are only three structural variables to explore, so the number of calculations 
needed to map out the surface is not too large. (Note that, with three geometric vari- 
ables, the energy "surface" is really a four-dimensional hypersurface.) These authors 
were also interested in the reactions occurring on this surface, i.e., Cl + OH — HCl +O 
and Cl+ OH > CIO +H, which required a global representation of the surface that 
was constructed from over 1500 individual energies. Since the full energy surface 
involves bond breaking processes, MCSCF and MRCISD methods were utilized. Accu- 
rate relative energetics between HOCI, HCIO, and the various dissociation asymptotes 
were obtained by carrying out calculations with a series of three correlation consistent 
basis sets at each geometry. This produced an approximate complete basis set (CBS) 
MRCISD energy surface. At the MRCISD CBS limit, HOCI was found to be more 
stable than HCIO by 53.7 kcal/mole and the barrier for HOCI — HCIO was predicted 
to be 73.5 kcal/mole above the HOCI minimum. 

After determining an analytical representation of this surface from the individual 
energies, these authors carried out calculations of the full anharmonic vibrational spec- 
trum of HOCI and HCIO by solving the Schródinger equation for nuclear motion. 
The HCIO molecule has not yet been experimentally observed, but these calculations 
predict that the lowest three vibrational levels of this species lie below its dissociation 
threshold, so it should be detectable. 


TABLE 11-12 » Reaction Barrier Energies for Reactions as Calculated by ab Initio Methods 


Reaction barrier 


(kcal/mole) 
CI 
Reactant Transition Product Reaction type SCF  (no.config. Reference 
H- +CH; (CH) CH4+H™ Sy2 59.3 55.2(6271) [41] 
F +CHF (FCH3F)” CH3F+F~ Sj2 5.9  5.9(26010) [41] 
H° +CH; CH? CH3 +H2 Radical 
abstraction 
(axial) 35.2 18 (692) [42] 
H° +CH; CH? CH4 + H° Radical 
exchange 


(inversion) 63.7 41.7(692) [42] 
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The decisions regarding basis set and level of correlation can be daunting and in 
the past this sometimes discouraged nonspecialists from taking advantage of ab initio 
methods. However, there are now a wide range of programs that are available, which 
have made ab initio calculations amenable to theoreticians and experimentalists alike. 
The best known of these is undoubtedly the GAUSSIAN series of programs originally 
developed by the group of J.A. Pople. In this and other programs, one can conveniently 
choose from a large variety of available basis sets and methods to carry out energy 
evaluations or geometry optimizations and harmonic frequency calculations. These 
programs have brought about a revolution in the way that chemical research is done. 

For small molecules (~1—5 nonhydrogen atoms) ab initio methods are sometimes 
more precise and reliable than experiment, especially for unstable systems. The saga 
of the energy difference between ground and excited CH? is one of the best known of 
these experimental-theoretical confrontations.!? 

In summary, ab initio calculations provide useful data on bond lengths and angles, 
molecular conformation and internal rotation barriers, for ground and excited states 
of molecules. They are also very useful for calculating accurate thermochemistry, 
ionization energies, oscillator strengths, dipole moments (as well as other one-electron 
properties) and excitation energies. If one has access to large blocks of computer time, 
ab initio calculations can reveal the nature of energy surfaces pertaining to chemical 
reactions or molecular associations, as in fluids. The accuracy of the calculation and 
the magnitude of the system are limited ultimately by computer speed and capacity. 
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At the beginning of this chapter it was stated that ab initio calculations require exact 
calculation of all integrals contributing to the elements of the Fock matrix, but we have 
seen that, as we encounter systems with more and more electrons and nuclei, the number 
of three- and four-center two-electron integrals becomes enormous, driving the cost of 
the calculation out of the reach of most researchers. This has led to efforts to find sen- 
sible and systematic simplifications to the LCAO-MO-SCF method—simplifications 
that remain within the general theoretical SCF framework but shorten computation of 
the Fock matrix. 

Since many of the multicenter two-electron integrals in a typical molecule have very 
small values, the obvious solution to the difficulty is to ignore such integrals. But we 
wish to ignore them without having to calculate them to see which ones are small since, 
after all, the reason for ignoring them is to avoid having to calculate them. Furthermore, 
we want the selection process to be linked in a simple way to considerations of basis set. 
That is, when we neglect certain integrals, we are in effect omitting certain interactions 
between basis set functions, which is equivalent to omitting some of our basis functions 
part of the time. It is essential that we know exactly what is involved here, or we may 
obtain strange results such as, for example, different energies for the same molecule 
when oriented in different ways with respect to Cartesian coordinates. 

A number of variants of a systematic approach meeting the above criteria have been 
developed by Pople and co-workers, and these are now widely used. The approximations 


18See Goddard [44] and Schaefer [45]. 
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are based on the idea of neglect of differential overlap between atomic orbitals in 
molecules. 

Differential overlap dS between two AOs, Xa and x», is the product of these functions 
in the differential volume element dv: 


dS = x, (1) xp (1) dv (11-53) 


The only way for the differential overlap to be zero in dv is for Xa or x», or both, to 
be identically zero in dv. Zero differential overlap (ZDO) between x; and x» in all 
volume elements requires that x; and x» can never be finite in the same region, that is, 
the functions do not “touch.” It is easy to see that, if there is ZDO between x; and x» 
(understood to apply in all dv), then the familiar overlap integral S must vanish too. 
The converse is not true, however. S is zero for any two orthogonal functions even if 
they touch. An example is provided by an s and a p function on the same center. 

It is a much stronger statement to say that x; and x», have ZDO than it is to say 
they are orthogonal. Indeed, it is easy to think of examples of orthogonal AOs but 
impossible to think of any pair of AOs separated by a finite or zero distance and having 
ZDO. Because AOs decay exponentially, there is always some interpenetration. 

The attractive feature of the ZDO approximation is that it causes all three- and four- 
center integrals to vanish. Thus, in a basis set of AOs x having ZDO, the integral 
(Xa (1) xb Q) [1/7121 xc (1) xa 2)) will vanish unless a = c and b = d. This arises from 
the fact that, if a Ac, x; (1) xc (1) is identically zero, and this forces the integrand to 
vanish everywhere, regardless of the value of (1/712) x5 (2) xa 2). 

It is not within the scope of this book to give a detailed description or critique of 
the numerous computational methods based on ZDO assumptions. An excellent mono- 
graph [46] on this subject including program listings is available. Some of the acronyms 
for these methods are listed in Table 11-13. In general, these methods have been popu- 
lar because they are relatively cheap to use and because they predict certain properties 
(bond length, bond angle, energy surfaces, electron spin resonance hyperfine splittings, 
molecular charge distributions, dipole moments, heats of formation) reasonably well. 
However, they generally do make use of some parameters evaluated from experimental 
data, and some methods are biased toward good predictions of some properties, while 
other methods are better for other properties. For a given type of problem, one must 
exercise judgment in choosing a method. 

As an example of the sort of chemical system that becomes accessible to study using 
such methods, we cite the valence-electron CNDO/2 calculations of Maggiora [56] 
on free base, magnesium, and aquomagnesium porphines. Such calculations enable 
us to examine the geometry of the complex (i.e., is the metal ion in or out of the 
molecular plane, and how is the water molecule oriented?), the effects of the metal 
ion on ionization energies, spectra, and orbital energy level spacings, and the detailed 
nature of charge distribution in the system. 

Use of acombination of methods is often convenient. Novoa and Whangbo [57] stud- 
ied theoretically the relative stabilities of di- and triamides in various hydrogen-bonded 
and nonhydrogen-bonded conformations, in both the absence and presence of solvent 
(CH2Cl5) molecules. There are many structural parameters to optimize in each of 
the conformations, and so high-level ab initio calculations for energy minimization of 
each class of structure would be prohibitively expensive. Instead, AMI was used to 
determine the optimum geometry for each configuration, and then ab initio calculations 
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TABLE 11-13 » Acronyms for Common Approximate SCF Methods 


Acronym Description 


CNDO/1 Complete neglect of differential overlap. Parametrization Scheme no. 1 
(Pople and Segal [47]). 

CNDO/2 Parametrization scheme no. 2. Considered superior to CNDO/1 (Pople 
and Segal [48]). 

CNDO/BW Similar to above with parameters selected to give improved molecular 
structures and force constants. (See Pulfer and Whitehead [49] and 
references therein.) 

INDO Intermediate neglect of differential overlap. Differs from CNDO in that 
ZDO is not assumed between AOs on the same center in evaluating 
one-center integrals. This method is superior to CNDO methods for 
properties, such as hyperfine splitting, or singlet-triplet splittings, 
which are sensitive to electron exchange (Pople et al. [50]). 

MINDO/3 . Modified INDO, parameter scheme no. 3. Designed to give accurate 
heats of formation (Bingham et al. [51] and also Dewar [52]). 


NDDO Neglect of diatomic differential overlap. Assumes ZDO only between 
AOs on different atoms (Pople et al. [53]). 
MNDO Modified neglect of diatomic overlap. A semiempirically parametrized 


version of NDDO. Yields accurate heats of formation and many 
other molecular properties, but fails to successfully account for 
hydrogen bonding (Dewar and Thiel [54]). 

AMI "Austin Model 1." A more recent parametrization of NDDO that 
overcomes the weakness of MNDO in that it successfully treats 
hydrogen bonding. (Dewar et al. [55].) 


(e.g., 6-31G** with MP2) were done for a few near-optimum geometries for each con- 
formation to check the AMI results. 

Additional helpful information on standard programs available at ab initio and 
semiempirical levels—where to get them, how to use them, what they have been used 
for—is available in the very well-written reference handbook by Clark [58]. 


11-17.A Problems 


11-1. Use the data in Table 11-3 to calculate the theoretical transition energies for Net 
when 1s and 2s electrons are excited into the 2p level. The experimental values 
are 2p < 2s, 0.989 a.u.; 2p < 1s, 31.19 a.u. 


11-2. Use the data in Table 11-1 to estimate separately the errors in ionization energies 
for the three states due to 


a) omission of electron correlation. 
b) failure to allow electronic relaxation. 


11-3. In Section 11-11, it is argued that neglect of electron correlation and electronic 
relaxation in setting [ p = —€, causes errors of opposite sign that partly cancel. 
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Would this also occur when Koopmans' theorem is used to predict electron affini- 
ties? Why? 

11-4. Demonstrate that, if 


a cthe 


b 5A 


a e 


b f 


11-5. A singly excited configuration y/, differs by one orbital from the ground state 
Vo and also by one orbital from certain doubly excited configurations y». 
Brillouin's theorem gives (oliya) = 0, but not (y1 || w2) — 0. Where does 
the attempted proof to show that (^| H | V2) — 0 break down? 


11-6. Show that, if y = coVo H c1V1 +coW2 +--+ cà, and if y is to be an 
eigenfunction of A with eigenvalue a, then it is necessary that all the V; (; = 
0, ..., n) also be eigenfunctions of A with eigenvalues a1. 


Deel. 


c 
and D2 = then Dı +å D = 
b. di 2 1 AD» 


11-7. How many distinct four-center coulomb and exchange integrals result when one 
has four nuclei, each being the site of five basis functions? Make no assumptions 
about symmetry or basis function equivalence or electron spin. 


11-8. For a homonuclear diatomic molecule, which of the following singly excited 
configurations would be prevented for reasons of symmetry from contribut- 
ing to a CI wavefunction for which the main "starting configuration" is 
la? lor2os lur? 


a) 102107202 It; lrg (i.e., Lu > Lg) 


b) 2o, > 30g 
c) 20, > Img 
d) log — 30g 


11-9, Write down the hamiltonian operator for electrons in the water molecule. Use 
summation signs with explicit index ranges. Use atomic units. 


11-10. An SCF calculation on ground state Hz at R = 1.40 a.u. using a minimal basis 
set gives a og and a oy MO having energies 


Eo, = —0.619 a.u. €o, = +0.401 a.u. 


The nonvanishing two-electron integrals over these MOs are 


i os (D)og Q)(1/ri2)og (1004 Q)dv(1)dv(2) = 0.566 a.u. 
Tf og (1)ou(2)(1/ri2)o¢(1)ou(2)dv(1)dv(2) = 0.558 a.u. 
Tf og(Lou(2)(1/r12)0¢(2)ou(1)dv(1)dv(2) = 0.140 a.u. 


Tf ou(l)ou(2)(1/r12)ou(1)ou(2)dv(1)dv(2) = 0.582 a.u. 


a) Write down the Slater determinant for the ground state of H2. 
b) Calculate the SCF electronic energy for Hz at R = 1.40 a.u. 
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c) Calculate the total (electronic plus nuclear repulsion) energy for H2. 

d) What is the bond energy for H predicted by this calculation, assuming that 
the minimum total energy occurs at R — 1.40 a.u.? 

e) Estimate the (vertical) ionization energy for H2. 

f) What is the value of the kinetic-plus-nuclear-attraction energy for one elec- 
tron in ground-state Hz according to this calculation? 
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Chapter 12 


Time-Independent 
Rayleigh—Schrodinger 
Perturbation Theory 


[p 12-1 An Introductory Example 


Imagine a city having one million resident wage earners. The city government plans 
to raise additional revenue by assessing each such resident a wage tax. This tax will 
not apply to wage earners residing in suburbs. The government estimates that a $10 
assessment will bring in new revenues of $10 million. This estimate assumes that the 
city population before imposition of the tax will hold after the tax is imposed as well. 
But the tax will produce a slight change, or perturbation, in the economic climate of 
the city. It is true that the tax is small, and so few people are likely to move to the 
suburbs as a result of it. Therefore, it is probably fairly accurate to use the population 
of the city before the perturbation to calculate the change in revenue brought about 
by the perturbation. But, if the perturbation were large, say $1000 a head, we would 
expect a substantial migration of wage earners to the suburbs, and so the estimate 
produced by using the original unperturbed population would contain substantial error. 
Corrections should be made, therefore, to account for population changes produced by 
the perturbation. 

This use of the unperturbed population to calculate the change in revenue is a crude 
example of a certain level of estimation (called "first order") in Rayleigh-Schródinger 
perturbation theory.! We will now proceed to develop the theory more formally in 
the context of wavefunctions and energies. The above example has been presented to 
encourage the reader to anticipate that there is a lot of simple good sense in the results of 
perturbation theory even though the mathematical development is rather cumbersome 
and unintuitive. 


E 12-2 Formal Development of the Theory 
for Nondegenerate States 


Perturbation theory involves starting with a system with known hamiltonian, eigenval- 
ues, and eigenfunctions, and calculating the changes in these eigenvalues and eigen- 
functions that result from a small change, or perturbation, in the hamiltonian for the 


1 Other perturbation methods exist, but the Rayleigh-Schrödinger (R-S) theory is the oldest and the most widely 
used in quantum chemistry. 
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system. We restrict the discussion to stationary states of systems having hamiltonians 
that are not time dependent. Let the known, unperturbed system have Ho as hamilto- 
nian, and let the eigenfunctions y; be orthonormal. Then 


Hoy; = Eivi (12-1) 
[vw dt = óij (12-2) 


and the functions y form a complete set as discussed in Chapter 6. We are interested 
in the system with hamiltonian 


H= Hh pH (12-3) 


where 1H’ is the perturbation.? (The parameter A is a scalar quantity that will be 
convenient in the mathematical development of the theory. When the derivation is 
complete, we will set à equal to unity so that it no longer appears explicitly in any 
formula, and then H’ must account entirely for the perturbation.) The eigenvalues and 
eigenfunctions of the perturbed hamiltonian H are unknown. Let us symbolize them 
as W and 6, respectively. Then 


Hi = Wiói (12-4) 


It is clear that, when A — 0, then H = Ao, ¢; = v; and W; = Ej. As X is increased 
from zero, W; and $; change in (we expect) a continuous way. In other words, W; 
and d; are continuous functions of the variable parameter A, and they are known at the 
particular value A — 0. Therefore, we can expand them? as series in powers of A about 
the point A — 0. Thus, for a given state i,* 


Here we must remember that A is a variable and Ww, Ww, etc. are constants. The 
superscript in parentheses is simply a label to tell us for which power of A this constant 
is the coefficient. Since we know that W; = E; when à = 0, we see at once that wO in 
Eq. (12-5) is equal to £;. Our problem is to evaluate Ww, Ww , etc. It is traditional 
to call wO (or Ej) the unperturbed energy or, sometimes, the energy to zeroth order. 
aw’? (which is just wi? after A is set equal to unity) is the first-order correction to 
the energy, and Ww + aw’? is the energy to first order. * Ww, ^ Wr 2155 efc: 
are the second-, third-, etc., order corrections to the energy. Normally, for expansion 
in a series to be useful, it is necessary for the series to converge at a reasonable rate. 
In most simple applications of perturbation theory, only a few orders of correction are 
made. Thus, one very commonly reads of energies calculated “to first order,’ or “to 


second order." Calculations to much higher orders are also made, but these are not as 
common. 


?Some treatments expand H as Ho + AH' + A2 H" 4... and ultimately achieve working formulas that appear 
different from those we will achieve. In fact, they are equivalent. For an example of this alternative formulation, 
see Pauling and Wilson [1, Chapter 6]. 

?In effect, we assume $j and W to be analytic functions of A in the range 0 € A < 1. 

^Tt is important to recognize that henceforth the subscript ; will refer to the state that we are studying as a 
function of A. 
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In precisely the same manner, we can expand the unknown eigenfunction 4; as a 
power series in A: 


0 1 2 3 
$; = 6 $29.) e A29? 41369 4... (12-6) 


Since j is a function of particle coordinates, eo, aru etc. are also functions, but 
they are invariant to changes in A. Again, it is clear that o = yi. o (or v) is 
the unperturbed or zeroth-order wavefunction, ag? is the first-order correction to the 
wavefunction, etc. 


Now we substitute Eqs. (12-3), (12-5), and (12-6) into (12-4) and obtain 


(Hy + X^) (of rg 12g) e) 


- (wf taw exo +.) (00 eae ciere) ^ ar» 


The variable in Eq. (12-7) is A, and each power of A is linearly independent of all other 
powers of A. As indicated in Section 3-4D, this means that Eq. (12-7) can be satisfied 
for all values of à only if itis satisfied for each power of X separately. Collecting terms 
having the zeroth power of A gives 


Hob) =W PpP (12-8) 
However, we have already recognized that 
eO =y, WP =E; (12-9) 


and Eq. (12-8) is simply a restatement of Eq. (12-1). Collecting terms from Eq. (12-7) 
containing A to the first power we obtain 


a (HoP + Ho? -mPsP-mi"sP)-o ——— 220 


This equality must hold for any value of 4, so the term in parentheses is zero. Hence, 
rearranging and making use of Eqs. (12-9), we have the first-order equation 


(H - wf?) Wi + at - Eg! =0 (12-11) 
Let us multiply this from the left by v? and integrate: 
* Olay. |Oogy(O oy * (1) SF x a (D) = 
[ imme v f vie [ines dt Ei [viel dt =0 
(12-12) 


Using the hermitian property of Ho, it is easy to show that the third and fourth terms 
cancel, leaving 


yi? =f V? H'y; dt (12-13a) 
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Thus we have arrived at an expression for the first-order correction to the energy in 
terms of known quantities. It is the expectation value for the perturbation operator 
calculated using the wavefunction of the unperturbed system. 'The analogy between 
this formula and the use of the population of the unperturbed city to calculate additional 
revenues from a new tax should be apparent. H' corresponds to the tax per wage earner, 
and fV y; dt corresponds to the sum of wage earners in the city before the tax was 
imposed. 


EXAMPLE 12-1 A one-dimensional box potential is perturbed so that it is raised by 


a constant amount, ô, in the left half of the box, and lowered by ô in the right half. 
What is the first-order change in energy for the lowest-energy state Y1? for y2? 


SOLUTION »  H' is antisymmetric for reflection through the box center, and y? is symmetric, 


so the integral giving w(D equals zero by symmetry. The same argument applies to y2. Another 


way to argue is to recognize that V? is symmetric in the box (for all n), which means that half of 
the distribution responds to the energy increase and half to the equal energy decrease, so the net 
energy change to first order is zero for all states. 4 


Equation (12-132) can be written in bra-ket notation (see Appendix 9): 


Wi? — (y; |H'| vi) (12-13b) 


or in the notation wherein an integral is indicated by affixing subscripts to the operator 
(i.e., as a matrix element) 


w® = Hi, (12-13c) 


In most discussions of perturbation theory one of these alternative notations is used. 
We will continue our formal development from this point using bra-ket notation for 


integrals. 

0 
' 1 

ith state, we first recognize that we can expand o! ) in terms of the complete set of 

eigenfunctions y: 


To find an expression for $; ', the first-order correction to the wavefunction for the 


Qe cS (12-14) 
j 


The summation symbol suggests that w is a discrete set of functions. This need not be 
true. Contributions from functions whose eigenvalues are in a continuum would require 
integration rather than summation. However, we will use the sum symbol since most 
actual applications of perturbation theory in quantum chemistry invoke only discrete 
functions. 

We now insert Eq. (12-14) into (12-11) to obtain 


(n - wi?) Wit at - E) ew; =0 (12-15) 
J 
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Multiplying from the left by v? and integrating yields 


(Wal LW) — Wf? (hio) + Y se (Gd Holy) — Ei(Wel)) 20 (12-16) 
J 


If k =i, this reduces to Eq. (12-13), as was shown above. If k Æ i, the terms in the sum 
all vanish except when j =k. Thus, 


(pel H' |) + ch? (Qo Hole) — Ei(velve)) 90, — (kei) (12-17) 


or 


cl) = SVEN pai (12-18) 


E; — E; 


Inserting this into Eq. (12-14) gives an expression for gi): 


; H' i 
g=) ww - w y, (12-19) 
i — £j 


This formula prescribes the way the first-order correction to the wavefunction is to be 
built up from eigenfunctions of the unperturbed system. We discuss this formula in 
detail later when considering an example. 

Note that, if the perturbed state of interest (the ith) is degenerate with another state 
(the /th), Eqs. (12-18) and (12-19) blow up for k or j equal to / unless (y| H'|vij) 
vanishes. Therefore, we restrict the theoretical discussion of this section to states of 
interest that are nondegenerate and discrete. (States of the system other than the ith 
may be degenerate or continuum states, however.) 

If we extract the terms containing A? from Eq. (12-7) and proceed, in the same way 
as above, to expand o as a linear combination of unperturbed eigenfunctions, 


$i» Dy; (12-20) 


J 


we arrive, after some manipulation, at the following formula for Wwe: 


E H' ; ; H' : 
w= y (Yil H Wi (wilt Wa) (12-21) 
D E—E; 
JED EUM 
Comparing this with Eq. (12-18) allows us to write 
2 
w= > |i? UE (12-22) 


Ji) 
or comparing Eqs. (12-19) and (12-21) gives 


WO = (pil H'10.?) (12-23) 
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The formula that emerges for gs of Eq. (12-20) is 


e Qa BW Hv) — Qu HwoHIv) .,. 12-24 
ji 2 (E; — Ex) (Ei — Ej) (E; — Ej?  i*€jit ) 


The boxed equations are "working equations" since they enable us to calculate the 
correction terms from the known eigenvalues and eigenfunctions of the unperturbed 
system. Equation (12-23) is interesting because it indicates that the second-order cor- 
rection to the energy is calculable if we know the zeroth and first-order functions.? 
Comparing Eqs. (12-13) and (12-23) shows that, whereas the first-order correction for 
the energy is the average value for the perturbation operator with the unperturbed wave- 
function, the second-order correction is an interaction element between two functions, 
and not an average value in the usual sense. 

Higher-order correction terms may be found by proceeding in a similar way with 
A3, A^, etc. terms from Eq. (12-7). The equations become progressively more cumber- 
some and will be of no interest to us for applications to be considered in this book. 

Having made use of the parameter à to keep terms properly sorted, we can now 
dispense with it by setting it equal to unity. Then 


H = Ho H’ (12-25) 
W, = E; t+ WP We 4... (12-26) 
$= Vi oj +P + riii 


Notice that $; is not normalized. The normalization coefficient needed will depend on 
the order to which $; has been calculated.® 


N 12-3 A Uniform Electrostatic Perturbation of 
an Electron in a “Wire” 
12-3.A Description of the System 


Suppose that a small uniform electric field is applied to an electron somehow constrained 
to move on a line segment of length L. In the absence of this field, we assume the 
electron states to be described by the one-dimensional “box” wavefunctions discussed 


(n) 


i 


5Lówdin [2] has shown that, if we know all the ój'supto$ 
y ntn. 


, we can calculate all the W;’s up to and including 


ii 


6 ; n2 TRUM SO stews, RUE is 
One can guarantee normality up to second order in $7 by setting cj; —0,c;; =— 7 Xk |C; | (see Schiff [3]). 
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0 


0 L x 


Figure 12-1 » Potential of an electron in a line segment of length L in the presence of a uniform 
electric field. The perturbation is “small” if the potential change u across L is small compared to 
a” /2L?, the energy in atomic units of the lowest unperturbed state. 


in Chapter 2. The electric field will be treated as a perturbation. The potential energy 
of the electron is sketched in Fig. 12-1. A uniform field produces a constant gradient 
(in potential energy) along the line segment. For purposes of discussion, we let the 
perturbation rise from zero at x =0 to u at x = L, but we shall see later that there is a 
degree of arbitrariness here. The perturbation, then, is given by 


H'=ux/L, O<x<L (12-28) 


where u, x, and L are measured in atomic units. 


12-3.B The Energy to First Order 


We now ask, what is the effect, to first order, of H' on the energy of the lowest- 
energy state of the electron? As described in the preceding section, this is obtained by 
calculating the average value of H' using the unperturbed wavefunction: 


L 
y BI | /2IL sin(rx/L)| (ux /L) | V2/E sin(rx/L)| dx (12-29) 


We now describe three ways to evaluate this integral. One way is to integrate explicitly, 
the other two ways involve simple inspection. 


1. Explicit Integration. Factoring constants from Eq. (12-29) gives 
i I 
Wt? =2u/L? / x sin?(x/L) dx (12-30) 
0 


To achieve a common variable, we multiply x and dx each by zr/ L and outside by 
L? /x*, thereby keeping the value unchanged: 
x=L 
yw = nu? | Gcx/ L) sin? (rx /L)d(ax/L) (12-31) 


x= 


Letting zx / L = y and noting that y=0, zx when x = 0, L, we have 


T 
wP = 2u? | ysin? ydy (12-32) 
0 
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3. 


Standard tables (see also Appendix 1) lead to a value of 27/4 for the integral, and 
Wi? = Qu/n?)(n?/4) =u/2 (12-33) 


To first order, then, the energy of the lowest-energy perturbed state is, in atomic 
units, 


W = (x?/2L?) +u/2 (12-34) 


Evaluation by Inspection: First Method. Equation (12-34) is certainly a reasonable 
result since, as the potential is increased everywhere in the box (except at x = 0), 
we expect that the energy of the electron should also increase. The fact that the 
increase is such a simple quantity (u/2) suggests that there might be a simple way 
to understand this result, and this is indeed the case. Consider the distribution of 
the electron in the lowest unperturbed state, shown in Fig. 12-2. This distribution is 
symmetric about the midpoint of the line segment. Consequently, for each instant of 
time the unperturbed electron spends in element dx, of Fig. 12-2, it spends an equal 
instant in the symmetrically related element dx. In other words, for each instant the 
electron experiences a perturbation potential less than u/2, it experiences an instant 
of potential greater (by an equal amount) than u/2. Hence, the average potential 
must be precisely u/2. Since we know that Y? is symmetric for every state in the 
unperturbed box, we can immediately extend our result and say that the first-order 
correction to the energy of every state is 4/2. 

The ability to evaluate first-order energies by inspection is very useful. Even in 
cases where exact evaluation by this technique is not possible, it may still be useful 
in making an estimate or in checking the reasonableness of a computed result. 


Evaluation by Inspection: Second Method. A variation of the above approach is 
sometimes useful. We begin by recognizing that, whereas H' is neither symmetric 
nor antisymmetric about the midpoint of the wire, we can make it antisymmetric by 
subtracting the constant u/2, as indicated in Fig. 12-3. 


(2/L) sin? (nx/L) = 9? 


Figure 12-2»  J?(n—1) fora particle in the unperturbed box. 
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ux/L — u/2 


u/2{ L/2 L 


Figure 12-3 » The function ux/L is unsymmetric about L/2, but ux/L — u/2 is antisymmetric 
about L/2. 


By writing 


H' =ux/L—u/2+u/2= Hay; +u/2 (12-35) 


we express H' as an antisymmetric function plus a constant. Our integral for w® 
becomes 


L L 
wP = Í Vui Hini nai dx + / Vai (u/2) Vii dx (12-36) 
0 0 


The first integral vanishes because y? is symmetric. The second integral is just u/2 
times unity since y is normalized. Hence, Wi” =u/2. 


EXAMPLE 12-2 A one-dimensional box potential is perturbed so that it is raised 


by a constant amount, ô, on the left side of the box but is unchanged on the right 
side. What is the first-order change in energy for v1? for y»? 


SOLUTION » One can argue, similarly to the reasoning in Example 12-1, that half of the 
distribution responds to the increase in potential, half to zero change, giving a net first-order 
increase of 8/2. Or, one can turn the perturbation into an antisymmetric function plus a constant: 


H' =H! — 8/2 4-8/2— H}; + 5/2, leading to WẸ? — 3/2. The same result applies to V? and all 
higher states because we is symmetric in the box for all n. < 


12-3.C The First-Order Correction to y, 


How should we expect the lowest-energy wavefunction to change in response to the 
perturbation? Since we are dealing with the lowest-energy state, we might expect 
the electron to spend more time in the low-potential end of the box (the nonclas- 
sical result), and so the wavefunction should tend to become skewed, as shown in 
Fig. 12-4. In this figure it is demonstrated how the perturbed wavefunction can be 
resolved into an unperturbed wavefunction and a correction, or difference, function. 
Since this correction function must increase y on the low potential side and decrease 
V on the high potential side, it is clear that it must be close to antisymmetric in nature. 
According to Eq. (12-19), the first-order approximation gi? to this correction function 
is formed by adding together small amounts of higher-energy wavefunctions. Com- 
paring the correction function in Fig. 12-4 and these higher-energy wavefunctions 
(Fig. 12-5) leads us to expect that /? will be a heavy contributor, whereas y3, being 
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V unpert 


Vpert e Vunpert 


(b) 


Figure 12-4 » Sketches of the lowest-energy wavefunction (a) before and after perturbation and 
(b) the difference between them. 


dili NM 


(a) (b) (c) 


Figure 12-5 » Sketches of the (a) second, (b) third, and (c) fourth (n = 2, 3, 4, respectively) wave- 
functions v for the particle in the unperturbed box. 


symmetric, will not contribute strongly. The mixing coefficient for yz is, according to 
Eq. (12-18), 


Jo (V2/Lsin2zx/L) (ux/L) (V2/Lsinnx/L)dx — 32L2u 
—3n?/2L? ED 


ol) = (12-37) 


Thus, e is positive, and y contributes to et? in the manner expected. The mixing 
coefficient for y3 is given by 


ay Wall)  (Yalu/2l43) , 
inge er Per Sr eee (12-38) 


where we have used expression (12-35) for H'. The first integral vanishes because 
V and w3 are symmetric. The second integral vanishes because they are orthogonal. 
Clearly, no symmetric state will contribute to gi”. 


Since y4 is antisymmetric, it can contribute to pe? On evaluation, we find that ey 


is about 2% of e (Problem 12-5). cQ? is so much smaller than ocd for two reasons. 
First, the integral (Vi | H'|v4) in the numerator of oy is much smaller than (yy; | H'|yr2) 


in ee This means that the shifting of charge produced by adding y4 to / is much less 
helpful for lowering the energy than is that produced by adding vy» to y. Examination 
of y» and y4 (Fig. 12-5) reveals why this is so. V? causes removal of charge from 
the right-hand half of the box and accumulation of charge in the left-hand half. v4 
causes removal of charge from the second and fourth quarters (numbering from the left) 
and buildup of charge in the first and third quarters. On balance y4 helps, but charge 
buildup in the third quarter is not desirable nor is removal of charge from the second 
quarter, and so y4 is much less helpful than y. The second reason for cw being so 


small is that E, — E4 in the denominator of eu is five times as big as E1 — E» in o. 
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In general, mixing between states of widely different energies is discouraged by the 


formula for ot? : 


The fact that a large contribution by y; to gi? is favored by large (w;|H’|y;) and 
small | E; — E j| is strikingly similar to the situation found for variational calculations 
(Chapter 7). There, the mixing between two basis functions x; and x j is favored if 
(xil H|xj) = Hi; is large and if | Hj; — H;;| is small. 


EXAMPLE 12-3 What sign should be expected for em for the perturbation of Exam- 


ple 12-1? of 12-2? 


SOLUTION » Assuming that y4 will behave anticlassically, and be skewed so as to decrease the 


probability distribution in the higher-energy region (left side), we expect ch to have the opposite 


sign of that in Eq. 12-37. Hence, it should be negative. This applies for both Examples 12-1 and 
12-2. 4 


12-3.D The Role of an Additive Constant in H’ 


Review of the results of Sections 12-3B and C will show that addition of a constant 
to H’ will change y by the same constant for all states and have no effect on er 
The change in energy of all states by a constant is equivalent to a relocation of the zero 
of energy and is normally of no interest. Our initial statement that the perturbation is 
produced by a uniform electric field left the additive constant in H’ unspecified. We 
chose to set H’ — 0 at x = 0, but we could have made any of an infinite number of 
choices for H’ at x — 0. A more sensible choice would have been the antisymmetric 
function H’ =ux/L — u/2 because this still leads to changes in wavefunctions due to 
the perturbation, but introduces no energy change to first order (wi? — 0 for all states 
using this H’). In other words, the antisymmetric function includes the relevant physics 
of the problem and excludes the trivial effects of a constant (u/2) potential change. 
Our choice of H’ =ux/L was made for pedagogical reasons. 


12-3.E The Calculation of W,” 


Since the constant first-order contribution to the energies of all the states is not physically 
interesting, let us examine the second-order contribution to the ground-state energy, 
Ww Equation (12-23) shows that this is related to the first-order correction to the 
wavefunction, gi? , which, as we have already seen, causes the wavefunction to become 
skewed toward the low-potential end of the box. It is clear from Eq. (12-22) that, in 
calculating the coefficients for ae we have already done most of the work needed to 
find i" We saw earlier that oi? is made primarily from y». For simplicity, we will 
neglect all higher-energy contributions, and so 


9t? z cP ys, = B2L7u/2724) Wr (12-39) 
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and, using Eq. (12-22), 
WO = a21?ufm^y (-32? 21?) (12-40) 


Thus, the effect of yo is to lower the energy. 

We pause at this point to summarize our results. The perturbation raises the potential 
everywhere in the box. (This depends on our choice of an arbitrary constant.) The 
energy to first order is increased by the same constant amount for every state. This is 
easily seen by inspection, utilizing simple symmetry features of H' and the unperturbed 
wavefunctions. The wavefunction for the lowest-energy state is skewed, to first order, 
in a way that is energetically favorable as far as interaction with H’ is concerned. 
The second-order contribution to the energy for this state is negative, reflecting this 
energetically favorable shift of charge. Thus far, everything behaves sensibly. We next 
examine the behavior of the second-lowest energy state, where some important new 
features occur. 


EXAMPLE 12-4 For the yı case of Example 12-1, should the energy to second 


order be higher or lower than the unperturbed energy? 


SOLUTION » The energy to second order is E, + pip + Ww. We have already seen that 
Wi =0. Thus the question comes down to asking whether wo is positive or negative. The first- 
order correction to the wavefunction shifts charge from the higher-energy regions, hence produces 


a negative wo. 4 


12-3.F The Effects of the Perturbation on y, 


We begin by examining os, the first-order correction to y. Inspection of Eq. (12-18) 


leads at once to the observation that co? = em. This means that, since y? contributes 


to gi? with a positive coefficient, y contributes to gs” with a negative coefficient. 
A sketch of y2 minus a small amount of y will show that this has the effect of shifting 
charge from the left half to the right half of the box. This is just the reverse of what we 
found for the lowest-energy state. 

The antisymmetry of Eq. (12-18) for interchange of i and k allows us to make the 
following general statement. Let a perturbation occur that raises or lowers the potential 
more in one region of space than in another. The first-order correction to a given 
wavefunction will contain higher-energy wavefunctions in a manner to cause charge 
to shift into regions of lowered (or less raised) potential and it will contain lower- 
energy wavefunctions in a manner to cause charge to shift into regions of raised (or less 
lowered) potential. 

We have not yet completed our construction of ei. We must calculate coefficients 
for contributions from the higher-energy functions y, V4, etc. The state y3 should 
contribute fairly strongly since it has the proper symmetry and is not too distant in 
energy from £»: 


(y. (al H'lya3) | 33222u 


= VS Á——— 12-41 
“32 Ey — E3 5 2574 ( ) 
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Therefore, y3 contributes to gs with a coefficient about 2 the magnitude of the con- 


tribution from y. The sign of cQ is positive and a sketch of w2 plus a small amount 
of y3 will demonstrate that this causes charge shifting to the left in accord with our 
general statement above. 

Since y4 contributes nothing (by symmetry) and ws is fairly distant in energy, we will 
neglect all contributions to pP above 3. 

We have, then, two sizable contributions to oP, each favoring charge shifts in 


opposite directions. Let us see how Ws? reflects this: 


WY? S ch)? (E, — Ey) - c^ (Eo — Es) (12-422) 
3 

2 eto? + Gee») x? /2L? (12-42b) 

= 12007 s? 27? (12-426) 


y? is the difference between energy contributions of opposite sign. The net result 
(energy increases) comes about in this case because v contributes more heavily 
than y. 

The fact that contributing wavefunctions from lower and higher energies affect Ww? 
oppositely is made evident by Eq. (12-22). Hence, we can extend our general statement 
above by adding that the second-order contribution to the energy from states below v; 
in energy cause the energy of the jth state to go up, contributions from above cause it 
to go down. 

Because the unperturbed energies of the particle in the box increase as n^, any state 
(except the lowest) is closer in energy to states below than to states above. Hence, for 
at least some kinds of perturbation, we might expect these states to "feel" the effects of 
states at lower energies more strongly and to rise in energy (as far as W ® is concerned). 
This is what happens in this example. There are no states below the lowest, and so 


2 


Ww cannot be positive, but Ws? is positive and it turns out that Ww is positive for 
all higher 7 as well. 

It is interesting to compare these results with those from classical physics. Classi- 
cally, the particle moves most slowly at the top of the potential gradient and therefore 
spends most of its time there. The lowest-energy state has responded in the opposite 
manner, in a way we might call anticlassical. The second and all higher states have 
responded classically. 


E 12-4 The Ground-State Energy to First-Order 
of Heliumlike Systems 


The hamiltonian for a two-electron atom or ion with nuclear charge Z a.u. is (neglecting 
relativistic effects and assuming infinite nuclear mass) 


1 
H(,)=-; (Vi 4 V3)- Z/ri - Zí/r2 - M rio (12-43) 
This may be written as a sum of one-electron operators and a two-electron operator: 


H(1,2) HQ) + HO) t dri (12-44) 
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where H(i) is simply the hamiltonian for the hydrogenlike system with nuclear 
charge Z: 


H(i)= -3¥} - Zfri (12-45) 


Since we know the eigenvalues and eigenfunctions for H (i), we can let H(1) + H(2) 
be the unperturbed hamiltonian with 1/7,» the perturbation. Such a perturbation is not 
very small, but it is of interest to see how well the method works in such a case. We 
have, therefore, for the lowest-energy state of the system: 


Ho = H(1) + HQ) (12-46) 
H' — rio (12-47) 
yi = (Z?/m)exp[-Z(r1 4 72)] (12-48) 
Ei = -Z?/2 — Z? /2 = -2? (12-49) 
W(? = (nll/rizWa) (12-50) 


All quantities are in atomic units. The unperturbed wavefunction [Eq. (12-48)] is simply 
the product of two one-electron 1s AOs. Because this is an eigenfunction of the system 
in the absence of interelectronic repulsion, it is too contracted about the nucleus. 

The first-order correction to the energy is the repulsion between the two electrons 
in this overly contracted eigenfunction [Eq. (12-50)]. We have encountered this same 
repulsion integral in our earlier variational calculation on helium-like systems. The 
evaluation of this integral is described in Appendix 3. Its value is 5Z/8. Hence, to first 
order, 


Wi =F, +W =-2°+5Z/8 (12-51) 


In Table 12-1, this result is compared with exact energies for the first ten members of 
this series. Several points should be noted: 


1. The effect of the perturbation to first order is to increase the ground-state energy. 
This is expected since 1/712 is always positive (i.e., repulsive). 


2. The energy to first order is never below the exact ground-state energy. This is a 
general property of perturbation calculations as is easily proved (Problem 12-1). 


3. The energy to first order is in error by a fairly constant amount throughout the series. 
For H7, this gives a substantial percentage of error and fails to show H™ stable 
compared with an H atom and an unbound electron. For higher Z, this error becomes 
relatively smaller since 1/ ri? becomes relatively less important compared with Z/7;. 
The assumption that H’ is a small perturbation is thus better fulfilled at large Z 
and results in Wi” being a much smaller correction relative to the total energy 
W,. Because "large" and "small" are relative terms, they can be misleading. Since 
energies of chemical interest are often small differences between large numbers (see 
the discussion at the end of Chapter 7), errors that were originally relatively small can 
become relatively large after the subtraction. Therefore, even though perturbation 
terminology would suggest that the results at Z — 10 are better than those at Z — 1, 
this may not be the case for some practical applications. 
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N 12-5 Perturbation at an Atom in the Simple Hückel 
MO Method 


Perturbation theory can be used to estimate the effect of a change in the value of the 
coulomb integral Hi; at carbon atom k. This is normally given the value of o, but it 
might be desirable to consider a different value due, for example, to replacement of an 
attached hydrogen by some other atom or group. We take the unperturbed Hückel MOs 
and orbital energies as our starting point and let the perturbed value of Hj; be a+ da. 
Therefore, 


H'=6a (at center konly) (12-52) 
The first-order correction for the energy of ¢;, the ith MO, is 
WI? = (Gil HI) (12-53) 
Now @; is a linear combination of the AOs x: 


$=) cux; (12-54) 
j 


and so 


Wi? zy Y henill B'xi) (12-55) 
po 


but H' is zero except at atom k, where it is da, and so the sum reduces to one term: 
wP = cto 8 12-56 
j 5 CkiCki ÔQ (12-56) 


Summing over all the MOs times the number of electrons in each MO gives the first- 
order correction to the total energy: 


WU = ba Y nictcu = Sarge (12-57) 
i 


where q; is the zr-electron density at atom k. Thus the energy change to first order is 
equal to the change in Hy, times the unperturbed electron density at that atom. This 
can also be seen to be an immediate consequence of the alternative energy expression 


E=} qi 4-2) pin Bim (12-58) 
I 


l<m 


which was derived in Chapter 8. 


EXAMPLE 12-5 Consider the methylene cyclopropene molecule, C4H4, in the 
HMO approximation. (See Appendix 6 for data.) At which carbon will a pertur- 


bation involving a affect the total x energy the most? Calculate the energy to first 
order if the value of o at that atom increases to o + 0.10008. Calculate the energy 
change, to first order, of each of the four MOs. 
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SOLUTION » The total energy will be most affected if the perturbation occurs at the atom 
having the greatest z-electron density, that is, at atom 4. To first order, the total m energy 
changes by (0.10008)(1.4881) = 0.14888, giving an energy to first order of 4a + 4.92648 + 
0.14886 = 4a + 5.11128. Each MO energy drops by (using Eq. (12-56), from lowest to highest, 
0.007946, 0.066465, 0.00008, 0.025598. The sum of the first two of these, times 2 (because of 
having two electrons each), gives 0.14886, which equals the first-order change in total z energy 
found above. 4 


Calculation of first-order corrections to the MOs proceeds in a straightforward man- 
ner using Eq. (12-19) (see Problem 12-17). One of the results of interest from such a 
calculation is the change in zr-electron density at atom / due to a change in the coulomb 
integral at atom k. The differential expression is 


óq1 = (8qi/0ot)óor = t1, kóotk (12-59) 


The quantity of interest 7r; ; is called the atom-atom polarizability. We will now derive 
an expression for 7; in terms of MO coefficients and energies. 

We assume that our MOs fall into a completely occupied set $1,..., $4 and a 
completely empty set Øm+1, ... , Øn. We take da to be positive, which means that center 
k is made less attractive for electrons. According to Eq. (12-19) the jth MO is, to first 
order, 

n 
d+ —ó;4 Y 
i-lizj 
where Æ; is the orbital energy of $;. As indicated above, the integral vanishes except 
over AO k. Therefore, 


(6; H'|oi) 


i 12- 
2 (12-60) 


"CE CK 6K 
ad) kj r 
= —— — — i 12-61 
oi 22 EE? (12-61) 
i=l, i Æj 
Expanding ¢; in terms of AOs, we have 
n n 
CkjCki 
op =a D yp cix (12-62) 
i=l,iżj 7 ! qa] 
We can write this as 
n 
1 
3 Y. (12-63) 
l=1 
where 
n 
0) Ck j Cki Cli 
imm) — — 12-64 
x IP ae, need 
i=l, i Æj 


The change in density at atom / equals the square of the perturbed wavefunction minus 
thatoftheunperturbed wavefunction and involves only the doubly occupied MOs 1 — m: 


m 
àq 223 [la +p)? - d (12-65) 
j=l 


m 
= DD (12-66) 
= 
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where we have neglected (cH 2 because the perturbation is assumed small. Substitut- 
ing Eq. (12-64) into (12-66) gives 


C]; Ckj;jCkiC 
qi= Asay Y 5 SEE idee (12-67) 
j-li-lizj Ei 


This equation can be simplified further by recognizing that, for j and i « m 4- 1, each 
term with denominator E; — E; has a mate with denominator E; — Ej. Hence, the 
terms differ only in sign and cancel. Therefore, 


C]jCkjCki Cli 


=0 (12-68) 
A kag ag 
j-lizj 
and so 
CljCkjCki Cli 
ô 48 12-69 
qi— DF »: Es ( ) 
j=li=m+1 
Comparing Eqs. (12-69) and (12-59) gives 
P Y 52 (12-70) 
j=li=m+1 f 


Related quantities, derivable in a similar manner, are the bond-atom polarizability, 
Tst,r, and the bond-bond polarizability, ttu, rs: 


a cd Crk (Cs jCtk + Ct jCsk) 
= rjer sjet tjes 
Tus —2), 3 E, E, (12-71) 
j=1 k=m+1 
M&A (CriCsk H CsjCrk)(CtiCuk + CujCtk) 
Jugs—2Y 7 »» rjCsk T uk ujCtk (12-72) 
jJ=lk=m+1 J k 


These refer, respectively, to the change in bond order ps; induced by a perturbation 
da, and to the change in bond order p;,, induced by a change in f,.;. 

The atom-atom polarizability formula (12-70) makes it evident that a large con- 
tribution to the change in electron density at atom / due to a change in the coulomb 
integral at atom k is favored if a filled and an empty MO exist that have large absolute 
coefficients on both atoms and are close in energy. However, another factor comes into 
play that usually confounds this expectation, as the following example shows: 


EXAMPLE 12-6 Consider the naphthalene molecule, C19Hs, in the HMO approx- 
imation. (See Appendix 6 for data.) If atom 1 were perturbed in a way to make it 


more attractive to z electrons, which of the atoms 2, 3, or 4, would you expect to 
experience the greatest population change? 
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SOLUTION » The HOMO-LUMO contribution to the charge shift favors atom 4, since its 
coefficients in these MOs are larger than those for atoms 2 or 3. However, the actual data (not given 
in Appendix 6) are: 71,2 = 0.21342, 714,3 = —0.01771, 711,4 = 0.13937. So atom 2 experiences 
the greatest population change, atom 3 the least. Why is this? It results from the fact that we 
are summing products that can be positive or negative. Two atoms near each other are less likely 
to have nodal planes separating them, so their coefficients are more likely to be of the same sign 
giving fewer sign changes in their products. So the sum has less tendency to cancel itself. This 
is the way in which MO theory predicts a tendency for polarization effects to die off as distance 
increases—the well-known “inductive effect." This dying-off is not monotonic, as can be seen 
in the remaining atom-atom polarizabilities: 71 5 = 0.02325, 1 6 = —0.00643, 71,7 = 0.03228, 
711,8 = —0.02674, 71 9 = 0.08889, 71,10 = —0.00356. < 


In general, 7; can have either sign. This means that making atom k more attractive for 
electrons will, to first order, cause the z electron density to decrease at some carbons and 
increase at others. (This is necessary since the total z-charge density must be conserved.) 


[J 12-6 Perturbation Theory for a Degenerate State 


Equations (12-19), (12-21), and (12-24) have denominators containing E; — £ j. If y; 
and yw; are degenerate, this leads to difficulty, alerting us to the fact that our earlier 
derivations do not take proper account of states of interest that are degenerate. The 
problem results from our initial expansion of $j as a power series in A. The leading 
term in the expansion, e, is the wavefunction that $; becomes in the limit when 
à — 0. In nondegenerate systems there is no is ambiguity; o has to be y;. Butif yy is 
degenerate with y;, any linear combination of them is also an eigenfunction. We need 
a method to determine how y; and y; should be mixed together to form the correct 


zeroth-order functions o and 90 n 
To find the conditions that $ m) and eo must satisfy, we return to the first-order 


perturbation equation (12-11) but with o in place of yj: 
(H' -W PpO + (Ho — Ej)ef? 20 (12-73) 


Multiplying from the left by Ou * and integrating gives (taking 9 and 9 to be 
orthogonal and Hp hermitian) 


(LB e) + CE; — E) 6f M1) =0 (12-74) 
If E; = E j, this gives 
(PIHI) =0 (12-75) 


This means that the first-order perturbation equation is satisfied for degenerate states 
only when the wavefunctions “diagonalize” the perturbation operator H’. Therefore, 
our problem reduces to finding those linear combinations of y; and y; that will diago- 
nalize H'. We have already seen two ways to accomplish this in earlier chapters. One 
way is to construct the matrix H’, where (H’);; = (wi| H'|v j), and then find the unitary 
matrix C that diagonalizes H’ in the similarity transformation C! H'C. The elements in 
C are then the coefficients for the linear combinations of y; and y;, and the diagonal 
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elements of the diagonalized matrix are equal to (99 |H’ TUE and (gO |H ; |e); 
they are the first-order corrections to the energy of e and g. The other method, 
equivalent to the above, is to set up the determinantal equation 

H-E — Hp 


=0 (12-76) 
Hy, M-E 


and solve for the roots E. Substituting these back into the simultaneous equations 
corresponding to the determinant gives the coefficients with which y; and y; should 
be combined. The roots Æ are the first-order corrections to the energy and are identical 
to the diagonal matrix elements mentioned above. 

In the event that the roots are equal, the perturbation has failed to split the degeneracy 
to first order and it is then necessary to proceed to higher order. We will not pursue this 
problem to higher orders, however. 

If there are n degenerate states, the above procedure is simply carried out with an 
n x n matrix or determinant. 

We will now give an example of perturbation theory for a degenerate state. 


[| 12-7 Polarizability of the Hydrogen 
Atom in the n =2 States 


In Chapter 7, the polarizability of the hydrogen atom in the 1s state was calculated by the 

variation method (see Section 7-4 and Problem 7-13). We now use perturbation theory 

for degenerate states to calculate to first order the polarizabilities of the n = 2 states. 
The perturbation due to a z-directed uniform electric field is, in atomic units, 


H' — — Fr cos0 (12-77) 


There are four degenerate states at the n = 2 level, giving us a 4 x 4 secular determinant. 
If we choose the real AOs as our basis set and let s, x, y, z represent these functions, 
the determinantal equation is 


Hs -E Hez Hy Hix 
H! H! — E H! H! 
ME icd x |=0 (12-78) 
Hy, Hy Hs —E Hy, 
Hys Hiz Hy Hy -E 
where 
Hi, =(y|H'|z), etc. (12-79) 


The operator H' is antisymmetric for reflection in the xy plane, but symmetric for 
reflection in the xz or yz planes. It follows from this that all integrals in Eq. (12-78) 
vanish except for Hj, and H;,, which are equal. Assigning a value of x (not to be 
confused with the x coordinate) to these integrals, our equation becomes 


—E x 0 0 
=0 (12-80) 
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The block diagonal form indicates that this is a product of one 2 x 2 determinant and 
two | x 1’s. Evidently, the proper zeroth-order wavefunctions must be a mixture of 2s 
and 2p; AOs, with 2p, and 2p, being acceptable as they stand. The roots of the 1 x 1 
determinants are zero, the roots of the 2 x 2 are E, = +x, E- = —x. These lead to 
coefficients that produce the zeroth-order wave-functions 


a? = (1/42) Qs +2p.) (12-81) 
eO = (1 I4) (2s — 2pz) (12-82) 


The roots +x and —x are the first-order corrections to the energy. We now proceed 
to calculate these quantities: 


x = (2s|H’|2p,) =---=3F (12-83) 


Our perturbation calculation indicates that the n = 2 level splits into three levels 
under the influence of an electric field. Since this splitting occurs to first order, it is 
sometimes called a first-order Stark effect. Since x is proportional to F, the splitting 
is linear in F, as indicated in Fig. 12-6. 

Itis instructive to compare these results with the behavior of the 1s state. In the first 
place, the effect of a uniform electric field on the 1s level is zero, to first order, because 
(1s| H'|1s) vanishes for reasons of symmetry. Only when first-order corrections are 
made to the 1s wavefunction are energy effects seen, and these occur in the second-order 
energy terms. Therefore, the 1s state gives a second-order Stark effect, but no first- 
order effect. The 1s state gives no first-order effect because the spherically symmetric 
zeroth-order wavefunction has no electric dipole to interact with the field. But the 
proper zeroth-order wave functions for some of the n = 2 states, given by Eqs. (12-81) 
and (12-82), do provide electric dipoles in opposite directions that interact with the 
field to produce first-order energies of opposite signs. 

Notice that the energy change for the second-order effect goes as F? [(Eq. (7-62)], 
whereas that for the first-order effect goes as F. These dependences are indicative of 
an induced dipole and a permanent dipole, respectively. The induced dipole for the 
1s state depends on the field strength F, since, as F increases, the dipole moment 
increases (due to mixing in higher states). This induced dipole, which depends on F, 


Field strength F 


Figure 12-6 » Energy to first order of n = 2 level of hydrogen as a function of uniform electric 
field strength (z-directed field). 


412 Chapter 12 Time-Independent Rayleigh-Schródinger Perturbation Theory 


then interacts with the field of strength F. Since both the size of the induced dipole and 
the energy of interaction with the field depend on F to the first power, the energy goes 
as F?. For the n = 2 wavefunctions p and 99, however, the dipole is permanent." 
(The mixing to produce these states occurs even in the limit as F goes to zero.) This 
permanent dipole interacts with the field to give an energy depending on F instead 
of F?. 

The 2p, and 2p, AOs are not affected to first-order in the perturbation. At higher 
orders, these AOs (as well as o and o) mix in higher-energy AOs of symmetries 
appropriate to produce induced dipoles. However, due to the equivalence of 2p, and 2p, 
with respect to H’, the induced dipole is the same in both cases so that the degeneracy 
is still not lifted. 

The qualitatively different behaviors of the n = 1 and n = 2 levels of hydrogen 
result from the fact that degenerate eigenfunctions can be mixed together at no energy 
expense, whereas the n = 1 state can produce a dipole only by mixing in higher-energy 
states. This suggests that the polarizability of an atom or molecule should be strongly 
dependent on the availability of fairly low-energy unoccupied orbitals or wavefunctions, 
and this is indeed the case. As we proceed down the series H^, He, Li*, Bet?, .. ., etc., 
we find that the distance in energy between the occupied state and the higher-energy 
states increases. Also, we find that the systems become less and less polarizable. Again, 
if we compare helium and beryllium, both of which are ordinarily considered to have 
s? valence-state configurations, we find beryllium to be much more polarizable. This 
is due to the presence of empty 2p orbitals lying fairly close in energy to the 2s AO 
in beryllium. In fact, these two atoms are strongly reminiscent of the n = 1, n = 2 
polarizabilities in hydrogen, the chief difference being that the 2s and 2p levels of 
beryllium are only close in energy—not actually degenerate. 


E 12-8 Degenerate-Level Perturbation Theory by Inspection 


Degenerate and nondegenerate perturbation theories share the feature that the first-order 
correction to the energy is the expectation value of the perturbation operator using the 
unperturbed wavefunction. Often we can evaluate or estimate this expectation value by 
inspection. The special problem for degenerate wavefunctions or orbitals is that there 
is an infinite number of ways to express the unperturbed wavefunctions, and only one 
of these ways is proper for finding W. The mathematical procedure for finding the 
proper set of functions tends to complicate and obscure the situation. However, it is 
often easy to deduce the proper unperturbed wavefunctions by inspection. 

The clue for doing this comes from Eq. (12-76). It is apparent that this equation, 
which leads to the first-order corrections to the energy and to the proper unperturbed 
(or zeroth-order) wavefunctions, is exactly like the secular determinantal equation we 
would use for a variational calculation. Since the secular equation comes from a 
calculus-based determination of energy extrema, we know that the energies that come 
out of Eq. (12-76) must be the minimum and maximum possible values and that the 
proper zeroth-order functions must be those that give the maximum and minimum 
expectation values for H'. This leads immediately to a powerful insight: For a given 


TIn the language of hybridization (Chapter 13), eO and oO are sp hybrids. 
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perturbation, the proper zeroth-order degenerate wavefunctions to use for predicting 
the effect on energies are that set giving the greatest difference in response. 

This rule can be seen operating in the hydrogen atom polarization example of the 
preceding section. The proper combination of 2s and 2p; AOs is the pair of sp hybrid 
combinations giving the greatest energy increase and decrease. (The 2p, and 2p, AOs 
do not mix any 2s or 2p; into themselves because to do so will have no energy-raising 
or -lowering effect.) 

Another example is seen when changing the coulomb integral at an atom in the HMO- 
level treatment of a cyclic molecule. Such molecules always have some degenerate 
MOs, and so one must decide which linear combination is "proper" for evaluating 
the effect of this perturbation. The situation is analyzed most easily by recognizing 
that mixing such degenerate MOs together corresponds to rotating the positions of 
the nodes and antinodes in the molecule. So the question of finding the proper MOs 
becomes one of asking where the nodes and antinodes should be located. The greatest 
difference in response to changing the coulomb integral at an atom comes when one 
of the degenerate pair of MOs has a node (i.e., is zero) at that atom and the other 
MO has an antinode (i.e., has maximum absolute value) there. Suppose, for example, 
you are asked to evaluate the effect on a particular degenerate pair of MOs of such a 
perturbation at carbon number 3 in a cyclic molecule. When you look at those MOs 
in a tabulation, you may find that atom 3 is not at the node or antinode position for 
either MO. Instead, you might observe that the node of one MO and antinode of the 
other occur at atom number 4. You could set about mixing these MOs to move the 
node-antinode positions to atom 3, but it is much easier and just as correct to pretend 
that the perturbation is occurring at atom 4, since, in a cycle, all atoms are equivalent. 
This allows you to use the tabulated MOs to evaluate the perturbation (now at atom 4) 
without any modification. Once this is recognized, the problem can be completed by 
inspection, just as in nondegenerate cases: The MO energy for the MO that has its node 
at atom 4 is unaffected to first order by the perturbation, and the other is affected by an 
amount proportional to the square of the coefficient at atom 4. Thus, the MOs become 
nondegenerate and are described to zeroth order by these "proper" functions. 

It is important to realize that the identity of the proper unperturbed wavefunctions 
depends on the nature of the perturbation. Thus, for the cyclic system discussed above 
(which we now assume has an even number of atoms), stretching the molecule so that 
two bonds on opposite sides of the cycle get longer is treated perturbatively with a 
different pair of proper MOs. We can quickly guess which MOs those are by seeking 
the pair that respond most differently to the bond stretching. Since any degenerate MO 
in this even-alternant cycle has at least one nodal plane perpendicular to the molecular 
plane, there is always a rotational position that places a nodal plane across the center 
of the two bonds being stretched. The mate to this MO then has an antinode in these 
bonds. That means that one MO is antibonding in these bonds and the other MO is 
bonding. This in turn means that one MO's energy will drop as the bond lengthens and 
the other MO's energy will rise. This, then, is the proper MO set to use for evaluating 
this perturbation to first order. Furthermore, these MOs are nondegenerate after the 
perturbation and are the zeroth-order approximation to the MOs of the perturbed system. 

The two perturbations described above bring up a final important point. Suppose 
we have two different perturbations occurring simultaneously in a system involving 
degenerate MOs or state functions. The question then arises as to whether the proper 
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zeroth-order functions are the same or different for the two perturbations. We have just 
seen that, if the perturbations involve substituting a new atom for a carbon and stretch- 
ing a pair of bonds in a cyclic molecule, the proper zeroth-order MOs are different. This 
means that one of these perturbations favors a postperturbative pair of nondegenerate 
MOs that is different from the pair being favored by the other perturbation. In other 
words, these perturbations compete to produce different final MOs. The other possi- 
bility is that two perturbations have the same set of proper zeroth-order wavefunctions, 
which means that they cooperate to produce the same final MOs. The cooperative 
situation is the one that gives the largest energy effects, since a single set of proper 
zeroth-order functions is selected that produces the greatest difference in response to 
both perturbations simultaneously. Competing perturbations require a compromise 
among zeroth-order functions which involves diminution in the differences in energy 
response. 

The reader may have realized that there is another way to select the proper zeroth- 
order wavefunctions, and that is to use symmetry. Recall that the proper wave- 


functions yo and yp are those that “diagonalize the determinant,” i.e., that cause 


UA |H'| y) to equal zero. In each of the examples cited above, this can be seen to 
be true. The H’ operator corresponding to changing the coulomb integral at an atom, 
and nowhere else, is unaffected by reflection through a plane containing that atom. The 
MO placing a maximum at that atom is symmetric for such a reflection, and the MO 
placing a node there is antisymmetric. Hence the integrand is overall antisymmetric and 
the integral vanishes, in accord with the requirement that must be satisfied by proper 
zeroth-order wavefunctions. A similar argument applies for stretching or shrinking 
bonds, with the reflection plane now occurring at the bond midpoints. As a useful guide, 
then, proper zeroth-order wavefunctions may be recognized as those having opposite 
symmetry for an operation by which the perturbation operator is unaffected. 


EXAMPLE 12-7 Consider the cyclopentadienyl radical in the HMO approximation. 


Assume it to be planar and pentagonal. (See Appendix 6 for data.) If a3 is perturbed 
to become a + 0.10008, what are the MO energy changes, to first order? 


SOLUTION »  Thelowest-energy MO is nondegenerate, and all atoms have the same coefficient, 
so AE, = (0.4472)? (0.10008) = 0.02008. The second two MOs are degenerate, so we need to 
identify the proper zeroth-order combinations. One should be symmetric with respect to reflection 
through atom 3, the other antisymmetric. None of the tabulated MOs have these properties about 
atom 3, but they do have these properties for atom 1. Therefore, we proceed by pretending that atom 
1 is the one being perturbed (it makes no difference). Then, A E2 = (0.6325)? (0.10008) — 0.04008, 
A E3 —0.00008, AE4 = AE», AEs = A E3. 4 


E 12-9 Interaction Between Two Orbitals: An Important 
Chemical Model 


Qualitative quantum-chemical discussion often relies on a simplified model wherein all 
interactions are neglected except for the primary one. For example, in considering how 
a Lewis base and a Lewis acid interact, one might consider only the highest occupied 
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MO (HOMO) of the base (electron donor) and the lowest unfilled MO (LUMO) of the 
acid (electron acceptor). Consideration of the ways two levels interact will provide 
some useful rules of thumb and also consolidate some of our earlier findings.? 

We label the unperturbed orbitals and energies Wy, Yp, and Eqa, Ey. We will work 
within the Hückel type of framework so that state energy differences may be written as 
orbital energy differences. The orbitals are allowed to interact with each other by virtue 
of close approach. We will assume that the perturbation felt by the orbitals is propor- 
tional to the overlap between them. (Ifthe systems involved are ions, strong electrostatic 
interactions will exist as well. Therefore, we assume all systems to be neutral.) 

Since the overlap of Ya on Wp is the same as that of Yp on Wa, the energy change 
to first order is the same for both levels and hence we ignore it (unless the levels are 
degenerate—an eventuality we discuss shortly). 

The second-order contributions to the energies are: 


(Wal lye)? 


wO = Dalt Yb 12-84 
à ELE ( ) 
H 2 
y _ Kal T Io) (12-85) 
EE 


Thus, the second-order energy y Oo is positive if E; > Ep, negative if Ea < Ey. In 
other words, the higher-energy level is pushed up, or destabilized, through interaction 
with the level below, and the lower-energy level is stabilized by interaction with the 
level above. This behavior, which we noted also in connection with the perturbed 
particle in a wire, may be summarized by the statement: interacting levels repel each 
other. Equations (12-84) and (12-85) also make clear that the levels repel each other 
more strongly the greater their interaction (numerator) and the smaller their energy 
separation (denominator). 

If E; = Ep, we must set up a 2 x 2 first-order perturbation determinant using v; 
and wy as basis, and find the two roots. Just as was true in the polarizability calculation 
of Section 12-7, we will find that one root lies above E,, and the other lies an equal 
distance below. Thus, for degenerate levels, the repulsion between levels becomes a 
first-order effect. 

The first-order mixing of v, and wp parallels the energy results in the expected 
way. If the second-order energy stabilizes the level, the orbitals are mixed in a bonding 
fashion. Since we have seen that it is the lower-energy orbital that is stabilized, we can 
state that, if two orbitals interact, the lower-energy one of the two mixes into itself the 
higher-energy one in a bonding way, while the higher-energy orbital mixes into itself 
the lower one in an antibonding way [5]. In short, “the upper combination takes the 
node" [5]. 

As an example of the utility of this model, consider the norbornadiene molecule 
(see Fig. 12-7a). According to our understanding of unsaturated systems, the double 
bonds in this molecule should behave like isolated ethylene double bonds and not 
like the conjugated bonds of butadiene. But these bonds are at an orientation and 
proximity allowing significant overlap between 2p; AO lobes beneath the molecule. If 
we treat this system with our two-level model, the unperturbed MOs are the bonding 


8The discussion in this section follows closely that of Hoffmann [5]. That article reviews a variety of illustrative 
applications of the two-orbital model. 
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Figure 12-7 » z MOsinnorbornadiene: (a) norbornadiene; (b) basis of unperturbed MOs; (c) basis 
of perturbed MOs; (d) quadricyclene (after R. Hoffmann [5].) 


and antibonding m MOs of two ethylene molecules (Fig. 12-7b). We expect the MO 
overlap across the molecule to split each of these levels as indicated in Fig. 12-7c, the 
bonding interaction producing stabilization in each case. (Combinations involving a 
bonding MO on one side and an antibonding MO on the other are ruled out because 
symmetry forces the interaction term to vanish. Even if this were not the case, the 
energy gap between these unperturbed MOs would make such contributions negligible 
compared to those from the degenerate MOs.) 

A careful look at Fig. 12-7c reveals certain implications about norbornadiene. For 
instance, the ionization energy for norbornadiene should be smaller than that for 
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norbornene, which has but one double bond. Experimental measurements support this 
contention. Also, since the highest occupied MO of norbornadiene is antibonding 
between carbons 3 and 5 and also 6 and 2, whereas the lowest empty MO is bond- 
ing between these carbons, excitation of an electron from the former MO to the latter 
should promote formation of quadricyclene (Fig. 12-7d). This compound is a common 
product in the photochemistry of norbornadienes. 


N 12-10 Connection Between Time-Independent 
Perturbation Theory and Spectroscopic 
Selection Rules 


Molecules may change to higher-or lower-energy states under the influence of incident 
light. Such processes are called, respectively, absorption and induced emission. Per- 
turbation theory can be used to study such transitions induced by an external oscillating 
electromagnetic field. Here we briefly describe a rather simple connection between 
selection rules and the perturbation theory we have discussed in this chapter. 

Let a molecule initially be in a state with wavefunction w;. We are interested in the 
probability of a transition occurring to some final state with wavefunction y y. A time- 
dependent perturbation treatment (not given here, but see Section 6-16) indicates that 
such a transition is probable only when the external field frequency v satisfies the 
conservation of energy relation 


v2 E, — Ej|/h (12-86) 


Even when this condition is satisfied, however, we may find experimentally that the 
transition is so improbable as to be undetectable. Evidently some factor other than 
satisfaction of Eq. (12-86) is also involved. 

Since the molecule is being subjected to light, it experiences fluctuating electric and 
magnetic fields. For ordinary (as opposed to magnetic) spectroscopy, the effects of 
the magnetic field are negligible compared to those of the electric field. Therefore, 
we ignore the former and imagine the molecule at a particular instantaneous value 
of an external electric field. As we have seen from earlier sections, this field causes 
polarization through the admixture of unperturbed wavefunctions v, Y2, etc., with v; 
to form a perturbed wavefunction ¢;. For a normalized $;, we have, therefore, 


$; — cii d ci c eo +e te: (12-87) 


At a later time, when the perturbation is over, the system returns to an unperturbed 
state. The probability of returning to the initial state is given by c¥c;. The probability 
of going instead to the final state described by v is given by cc f- M cy is zero, there 
is no tendency for a transition to that final state to occur and the transition is said to be 
forbidden. If c y is nonzero, the transition is allowed. 
Since the coefficient c y is given to first order by [see Eq. (12-18)], 
(Y flH Ihi) 


ERR RH 12-88 
UN ere Ey ( ) 


where H' is the perturbation operator for the electric field component of the light, 
we can say at once that a transition between two states is forbidden (to first order) if 
(Y fl H'|vr;) vanishes. 
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One example of a forbidden transition is that between a singlet state and a triplet 
state of a system. We can see this at once since H' is an electric field and does not 
interact with spin magnetic moment. Therefore the orthogonality between y; and w+ 
due to spin functions is uninfluenced by H’ and the integral (y j| H' |j) must vanish. 
In effect, the perturbation H’ causes the initial singlet state to become polarized by 
mixing in other singlet state functions, but gives no impetus for mixing in states of 
different multiplicity. Transitions between states of different multiplicity are said to be 
spin forbidden. 

Another example of a forbidden transition is that between two different s-type states 
of a hydrogen atom. Such states have spherically symmetric wavefunctions, but H’ 
(the electric field) is antisymmetric for reflection through a plane (to within an addi- 
tive constant), and so (Y |H"|w;) must vanish for reasons of symmetry. It is easy 
to generalize this argument to other states of the hydrogen atom (p to p, d to d, 
etc.) and also to certain other atoms (e.g., the alkali metals) electronically similar to 
hydrogen. 

Molecular electronic transitions can be understood from the same standpoint. The 
intense 2* «— zt transition in ethylene is a simple example.” !° Let us imagine that the 
molecule is oriented as shown in Fig. 12-8. Suppose we could somehow orient all our 
molecules this way (in a host matrix or in a crystal) and that we then subjected the sample 
to plane-polarized light. We will consider what should happen for light polarized in 
each of the directions x, y, z. First, let us consider the integral (z'|z|z *). The ethylene 
molecule has three reflection planes of symmetry, so we can examine the symmetry 
of the integrand with respect to each of these three reflections. Remember that, if the 
integrand is antisymmetric for any one of these reflections, the integral vanishes. The 
function zr can be seen, from inspection of Fig. 12-8, to be symmetric for reflection in 
the xz and yz planes, antisymmetric for reflection in the x y plane. These observations 
are shown in Table 12-2, along with similar conclusions regarding the symmetries of 
the functions z*, x, y, and z. The symbols “s”? and “a” stand for “symmetric” and 
“antisymmetric.” Our integral (z|z|z*) can now be seen to have an integrand that 
is symmetric for reflection in the xz plane but antisymmetric for reflection in the xy 


T T 


Figure 12-8 »  ; and z* MOs for ethylene. The C-C bond is coincident with the x axis and all 
nuclei lie in the x y plane. 


? Here, z'* refers to the antibonding zg MO, not to complex conjugation. 
10Tn this discussion, we consider molecules of high symmetry, so that reflection planes of symmetry exist. For 
molecules of lower symmetry, one must resort to actual calculation of the c ¢ in Eq. (12-88). 
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TABLE 12-2 » Symmetries of Functions under Reflection through 
Cartesian Coordinate Planes 


Symmetry operation 


Function x y reflection xz reflection yz reflection 
T a S S 
m* a S a 
2 a s s 
x s s a 
y s a s 
m zm* aaa=a SSS=S ssa=a 
mxm* asa=s SSS=S saa-s 
m yz* asa=s sas=a ssa=a 


and yz planes. Hence, this integral vanishes, and this transition is forbidden insofar 
as light polarized perpendicular to the molecular plane is concerned. According to 
Table 12-2, symmetry also causes (z|y|7*) to vanish. However, symmetry does not 
force (7 |x |z*) to vanish. Therefore, the 7* < ~ transition is allowed and is polarized 
along the C-C axis. Indeed, because of the significant spatial extension of the x and 
7 * MOs (compared to AOs), the integral (7x |x |zr*) is relatively large, which means that 
the transition is not only allowed, but is intense. 

It is physically reasonable that the 7* < x transition should be polarized along the 
C-C axis. If some z'* character is mixed with 7r, it is easy to see that this results in a 
shift of x charge from one carbon to the other—a shift along the x axis. Conversely, an 
electric field in the x direction will cause polarization along the x axis, hence mix z* 
character into the z MO. When the perturbation is removed, a finite probability exists 
that the molecule will go to the state wherein z* is occupied. 

In the above example, we imagined all ethylene molecules to be identically oriented. 
Under those conditions, we would observe a maximum in zr * < x absorption when our 
incident light was polarized parallel to the molecular axis and zero absorption (ideally) 
when the polarization axis was perpendicular to the molecular axis. If the light is 
unpolarized, or if the ethylene is randomly oriented, as in liquid or gaseous states, the 
absorption is isotropic and allowed because there is always a certain degree of “overlap” 
between the molecular axes of most of the molecules and the direction of the electric 
field due to the light. 

The example discussed above illustrates a simple and powerful rule for recognizing 
whether or not a given atomic or molecular orbital dipolar transition is allowed (assum- 
ing electron spin agreement): A dipole transition is allowed between orbitals that have 
opposite symmetry for one and only one reflection plane.!! The polarization of the 
transition will be perpendicular to that reflection plane. 


llThis means that, for an allowed transition, one orbital has one and only one nodal plane where the other 
does not. 
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EXAMPLE 12-8 Selection rules for allowed dipolar transitions in a hydrogen atom 
include AJ = X1, Am; =0, +1. Show that these rules are equivalent to the sym- 
metry rule stated above.“ Consider the following cases: 2s «— 1s, 2px < 1s, 
3d, <— 25, 3d,; <—2 pz, 3dyy <- 2 pz. 


“There is yet another way to rationalize these selection rules. Photons are bosons with spin quantum 
number of 1, hence they carry spin angular momentum. The total angular momentum of the atom or molecule 
that absorbs a photon must change to obey the conservation requirement consistent with the vector addition 
of orbital and photon angular momenta. The A/ rule handles that for one-electron atoms. Furthermore, 
for a unit change of /, the z-component of the absorber's orbital angular momentum can change by +1, 0, 
or —1 a.u. The Am; rule takes care of that. This rationalizes the atomic selection rules, but doesn’t yield 
information on polarization or intensity as readily as the more physical approach described in the text, and 
it is less easily applied to molecules. Generally, however, the more ways one understands a thing, the better. 


SOLUTION »  The2s < 15 transition is forbidden by the A/ rule (because A/ = 0) and also by 
the symmetry rule (because neither wavefunction has a nodal plane). This makes it easy to see 
that the A/ rule applies because it forces one of the orbitals to have one and only one more nodal 
plane than the other. The 2p, < 1s transition is allowed by the A/ rule. Since 2 py is a linear 
combination of 2 p..1 and 2p. 4, the Amy rule is also satisfied, so the transition is allowed. When 
we examine symmetry, we note that there is only one plane of disagreement-the yz plane, so this 
approach also indicates that the transition is allowed, and that it is polarized in the x direction 
(perpendicular to the yz plane). The 3d y — 2s transition is forbidden by the A/ rule since A7 = 2. 
The symmetry approach indicates that a transition is forbidden because the 34, y AO has two nodal 
planes, whereas 2s has none. The 3dyz «— 2 p; transition has A/ = 1. 3dyz isa combination of 3d41 
and 3d..,, and 2pz is 2po, so Am, = «1. Therefore, this is an allowed transition. The symmetry 
approach shows 3d, with two nodal planes (x y and yz) and 2 pz with one (x y). Therefore, there 
is only one plane of symmetry disagreement (yz), so the transition is allowed and is x polarized. 
The 3d < 2pz transition has AJ = +1. 3d,y is a linear combination of 3d.» and 3d_2, and 
2pz is 2po, so Am, = +2. Therefore, the transition is forbidden. The symmetry analysis shows 
3d, , having two nodal planes (xz and yz), and 2p; having one (x y). Therefore, they disagree in 
symmetry in three planes, and the transition is forbidden. This last case makes it clear why the Am; 
rule is needed: When satisfied, it guarantees that all but one of the nodal planes in the two AOs 
coincide. (If Am; = +2, then, taking the z axis to be vertical, one AO has two more vertical nodal 
planes than the other, so there must be at least two reflection planes of symmetry disagreement.) < 


12-10.A Problems 


12-1. Prove that the energy to first order for the lowest-energy state of a perturbed 
system is an upper bound for the exact energy of the lowest-energy state of the 


perturbed system, that is, that Eo + wo > Wo. 


12-2. A one-dimensional box potential is perturbed as sketched in Fig. P12-2. From a 
consideration of the first three wavefunctions of the unperturbed system, v, Y2, 


Figure P12-2 » 
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w3, which will have its energy increased most, to first order, and which least? 
No explicit calculation is necessary to answer this question. 


12-3. Evaluate by inspection the first-order contributions to the energies for the states 
shown in Fig. P12-3. In every case the unperturbed state is a particle in the 
indicated state in the one-dimensional, infinitely deep, square well. 


le) (f) 


Figure P12-3 » a) H' — sine function as shown, y; = y1. b) H’ as shown, y; = V4. c) H’ as shown; 
Vj = V1. d) H’ as shown; y; = V. e) H' = cosine as shown, y; = Va. f) H’ as shown; yj = V». 


12-4. a) Evaluate ws” for the particle in a box with the perturbing potential shown in 
Fig. P12-4. 
b) For the perturbation above: 


1) What sign would you expect for e? Describe the effect of this on $1; 
on wo. 
2) What sign would you expect for c9? Describe the effect of this on d»; 
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12-5. 


12-6. 


12-7. 


12-8. 


12-9. 
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Figure P12-4 » 


Calculate vi for the perturbed particle in a wire example discussed in Sec- 
tion 12-3. Show that eu is only about 2% as large as e Use 


T 
Í ysinmysinnydy = [Ent — 1] 2mn/(m? —n?Y', m,n21,2,...,m zn 
0 


An electron moves in a harmonic potential, V — 1kx?. What is the effect, to first 
order, on the energies of superimposing an electric field, V’ = Ex? Explain your 
reasoning. 


Calculate the energy to first order of He* in its lowest-energy state. Use the 
hydrogen atom in its ground state as your zeroth-order approximation. Use 
atomic units. Predict the signs (plus, zero, minus) of OU and c 


your reasoning. 


. Explain 


The previous problem gives an energy to first order of —3/2 a.u. for Het, using 
H as a starting point. Now reverse the process and try to get the ground-state 
energy of H to first order, using He* as starting point. Discuss the reasonableness 
of your answer in terms of a lower bound. 


A hydrogen atom in its ground state is perturbed by imposition of a uniform 
z-directed electric field of strength F atomic units: Field = — Fz = — Fr cos0. 


a) What is the first-order change in energy experienced by the atom? Show 
your logic. 

b) For the first-order correction to the ground-state wavefunction, we can con- 
sider S and p 1s to be the coefficients for mixing in 2s and 2p; char- 
acter. Predict the signs of these coefficients and sketch their effects on the 
ground-state wavefunction. 


c) Predict the sign of WY and explain your reasoning. 
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12-10. 


12-11. 


12-12. 


12-13. 


12-14. 


12-15. 


12-16. 


What is the Hückel MO zr-electron energy to first order of the molecule (I) if 
C4 is perturbed by H’ — 0.18? (See Appendix 6 for Hückel data.) 


Consider the fulvene molecule (IT) in the HMO approximation (see Appendix 6). 


1 
2 
5 3 
4 
qm 


At which carbon will a perturbation involving o affect the total x -electron energy 
the least? Calculate to first order the energy of fulvene with ag =a + 0.55. If 
a computer and Hückel program are available to you, calculate the energy for 
this perturbed molecule directly and compare with your first-order result. 


Produce an expression for z;, the atom-atom self-polarizability, from 
Eq. (12-70). Can this, like 77%, have either sign? What can you infer about the 
sign of dq, when da; is positive? Discuss the physical sense of this. 


Using data from Example 12-6, calculate the self-atom polarizability, 71,1 for 
carbon 1 in naphthalene. 


Consider the hexatriene system. (See Appendix 6 for data.) One of the two 
central atoms is perturbed so that o becomes replaced by a + AB on that one 
atom only. 


a) Whatis the energy change, to first order, of the total z energy of hexatriene? 
b) Which of the occupied MOs is perturbed the most (1.e., has the largest first- 
order contribution to the energy) by this perturbation? 


Answer the following questions from examination of HMO data for benzcy- 
clopentadienyl radical, CoH7, given in Appendix 6. 


a) What is the total zt energy to first order if atom 5 is replaced by a new atom 
X which contributes the same number of zr electrons a carbon did and has 
dy, =a+0.56 and also has fc y = 0.98? 

b) For the positive ion of this system, how do you think the self-atom polariz- 
ability should compare qualitatively at atoms 1 and 2? Make your prediction 
by inspection, rather than computation, and explain your thinking. 


Use simple Hückel MOs for butadiene to calculate to first order the change in 
x energy that would result from closing cis-butadiene to cyclobutadiene (III). 
Repeat the approach for closing hexatriene to benzene. (IV). Which of these 
two systems benefits most from cyclic as opposed to linear topology? For each 
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12-17. 


12-18. 


12-19. 


12-20. 


12-21. 


12-22. 
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system, compare your energy to first order with the actual Hückel energy (see 
Appendix 6 for data). 


The unperturbed Hückel MO energy levels of the allyl system are sketched in 
Fig. P12-17. The system is perturbed so that H22 =a + cB where c is positive. 


a) Sketch the effects of this perturbation, to first order, on the energy levels. 

b) Calculate to first order the perturbed MOs pi. gs”, gs” in terms of the 
unperturbed MOs wi, v», W3. Sketch the results in a manner that makes 
clear the nature of the change in each MO. 


a — /28 2 
d 1 3 
a * /28 


Figure P12-17 » 


Calculate the atom—atom polarizabilities 71,2 and 71,3 for the allyl cation. What 
do your results indicate will happen to the zr-electron densities at atoms 2 and 
3 if atom 1 becomes more attractive? 


The cyclopropenyl system is perturbed so that the Hückel matrix element H22 = 
à + cf, where c is positive. 


a) Ascertain, by calculation or inspection, the appropriate zeroth-order degen- 
erate wavefunctions for this situation. 
b) Sketch the effects of the perturbation on the orbital energies to first order. 


Given the HMO data in Appendix 6 for the cyclopentadienyl radical, calculate 
to first order all the orbital energies you would obtain if atom 2 were changed 
so that a2 became a + 0.58. 


Imagine two allyl radicals coming together to form benzene. Each allyl has 
three x MOs and benzene has six. 


a) Sketch an energy level diagram showing how interaction between allyl 
orbitalscanberelatedtothe MOenergiesofbenzene, atthesimple Hückellevel. 

b) Show with sketches how the three lowest-energy zeroth-order allyl combi- 
nation orbitals are related to the three lowest-energy benzene MOs. 

c) Estimate to first order the energies of the benzene-like MOs coming from 
the nonbonding MOs of the allyl pair. Compare these to the relevant MO 
energies of benzene. (Remember that, when combining fragments, you must 
renormalize the functions) 

d) What is the energy to first order of the lowest MO of the allyl combination, 
and how does it compare to the lowest benzene orbital energy? 


Discuss the possible transitions 773 «— m2 and 74 < 7r? in butadiene (Here z is 
the lowest-energy MO, etc.) 


a) Are they dipole-allowed transitions? 
b) If yes, what is the polarization? 
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TABLE P12-24" » 


Level Energy Cl c2 C3 c4 C5 C6 c7 

7 —2.018 —0.238 0.500 —0.406 0.354 0.336 0.354 —0.406 
6 —1.268 —0.397 0.500 —0.116 —0.354 0.562 —0.354 —0.116 
5 —1.008 0 0 —0.500 0.500 0 —0.500 0.500 
jo mcr 0.00 —0.756 0 0.378 0 —0.378 0 0.378 
3 1.008 

m a. 

if | 2.108 


“Coefficients for 2pz basis function forming the four highest MOs are listed. Overlap has been 
assumed to be negligible. 


12-23. 


12-24. 


12-25. 


Calculate the dipole moment in the z direction for the states Qi of Eq. (12-81). 
Now construct a variational function of the form 


$ = cos(a)2s + sin(o)2pz 


and maximize the z component of the dipole moment as a function of œ. Are the 
states ow those of maximum dipole magnitude? Discuss why this is reasonable. 


The benzyl radical, C7H7(V), has the Hiickel energies and ground-state config- 
uration given in Table P12-24. 


The radical is trapped and oriented in an external reference system as shown. 
Light polarized in the x direction is beamed on the system, and the frequency 
varied until an absorption is observed. Assuming this to result from excitation 
of the unpaired electron, to which level has the electron been promoted? 


The simple Hiickel energies, occupation numbers, and coefficients for MOs 
in naphthalene (VI) are listed in Table P12-25. Assume that a single crystal 
of naphthalene is oriented so that each molecule is aligned with respect to an 
external coordinate system (VI). 


a) Light polarized in the x direction is beamed on the crystal. Assuming that 
the electron is excited from the highest occupied MO, to which empty MOs 
could it be promoted by the x-polarized light? 

b) Which transitions from the highest occupied MO would be allowed for 
y-polarized light? 

c) Which ones would be allowed for z-polarized light? 

d) Are any transitions from the highest occupied MO forbidden for nonpolar- 
ized light? 
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12-26. A hydrogen atom in the 2s state is metastable, but, when passed through an 
electric field, its tendency to relax to the 1s state is greatly enhanced. Explain. 


12-27. Using EHMO data for formaldehyde given in the Problems section of Chap- 
ter 10, decide which empty MOs can be reached by allowed dipolar transitions 
of an electron out of MO 7. For each allowed transition, state the polarization 
with respect to the molecular axis. 


12-28. Square cyclobutadiene has a degenerate pair of nonbonding MOs. Calculate 
the energies of these MOs, to first order, when two carbons are replaced by 
atoms that contribute the same number of pi electrons as carbon did and are 
treated at the HMO level by replacing o with o + 0.58, and the substitutions 
occur at a) opposite and b) adjacent positions. Discuss your results in terms 
of cooperative vs. competing perturbations. Which compound should be more 
stable? 


12-29 Calculate to first order the HMO energy changes for the occupied MOs of 
benzene that occur when the molecule is stretched so that an opposing pair of 
bonds have resonance integrals of 0.98 instead of 1.08. 


12-30 The intensity of a transition between initial (V/;) and final (v y) states is depen- 
dent upon the transition dipole moment: ji; =e f wirwidt. Calculate ji in 
atomic units for 2po «— 1s in a hydrogen-like ion having atomic number Z. 
Assume that the electron has the same spin in both states. (See Appendix ?? 
for the definition of 1 a.u. of electric dipole moment.) 


Multiple Choice Questions 


(Try to answer these without referring to the text.) 
1. For the n — 2 states of a hydrogen atom in a uniform electric field of strength F, 


a) the first-order correction to the energy equals zero for all four states. 

b) there are four different values of the energy to first order. 

C) the first-order correction to the energy is proportional to F for two states. 
d) the first-order correction to the energy is proportional to F? for two states. 
e) None of the above is a true statement. 


2. Which one of the following statements is true for a hydrogen atom in its ground 
state that is placed in a uniform z-directed electric field? 


a) The first-order correction to the energy equals zero. 

b) The proper zeroth-order wavefunctions are (1/ 42) (1s x2 pz). 

c) The second-order correction to the energy is positive. 

d) The first-order correction to the wavefunction contains some 2s AO. 
e) None of the above statements is true. 
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3. The hydrogen atom transition 3d, «— 2 py is 


a) dipole allowed and z-polarized. 
b) dipole allowed and x-polarized. 
c) dipole allowed and y-polarized. 
d) dipole allowed and x y-polarized. 
e) forbidden. 


4. Which of the following integrals over all space vanish by symmetry for a hydrogen 
atom with nucleus at the coordinate origin? 1) f 1s y2p;dv 2) f 2pz z3dy,dv 
3) 42pn9 x30 dv 


a) lonly. 

b) 1 and 2 only. 
c) 1 and 3 only. 
d) 2 and 3 only. 
e) 1,2, and 3. 
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Chapter 13 


Group Theory 


[p 13-1 Introduction 


It should be evident by this point that symmetry requirements impose important con- 
straints on MOs, and that one can often use symmetry arguments to tell what the MOs 
of a molecule must look like, in advance of calculation. We have also seen that one 
can often use symmetry arguments to tell whether or not an integral vanishes. Thus, 
we have made much use of symmetry already, but in an informal way. The formal 
use of symmetry, through group theory, will be described in this chapter. Knowledge 
of group theory augments one's power to use symmetry as a shortcut, and familiarity 
with formal symmetry notation is necessary to follow the literature of many areas of 
chemistry, particularly inorganic and quantum chemistry, and molecular spectroscopy. 


N 13-2 An Elementary Example 


Consider the following four operations, or commands: 
1. Left face (L). 


2. Right face (R). 
3. About face (A). 


4. Remain as you are (£). 


These four operations constitute a group in the mathematical sense, and provide a 
convenient example with which to illustrate some definitions and terminology of group 
theory. The operations are called the elements of the group. Because there are four 
elements, this group is said to be of order four. 

How do we know that these four operations constitute a group? Before answering this 
question, it is necessary to describe what is meant by a product of elements. Products 
of group elements are written in the usual algebraic manner. That is, the symbols are 
written side by side without an algebraic symbol. Just as xy is understood to be the 
product of x and y, LR is understood to be the product of “left face" and “right face.” 
But elements of a group need not commute, and so the ordering of symbols in a product 
is important in group theory. The sequence of operations is understood to read from 
right to left in the product. Thus, L R means "right face" followed by “left face.” 

If we imagine a drill soldier carrying out the sequence of operations implied by 
LR, the result is a return of the soldier to his original position. That is, the product of 
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TABLE 13-1 » Products of 
Elements in the Group of Four 
Commands 


First operation 


& E L R A 
: 

Z| E|E L RA 
gi L|[L A E R 
S| RIR E A L 
[5] 

A| A|A RLE 


operations L R gives the same result as the single operation E. This is written L R = E. 
It is possible to write similar equations for all the product combinations in our group, 
and to arrange the results as in Table 13-1. An important fact to notice from this group 
multiplication table is that there is no sequence of operations in the group that cannot 
be accomplished by a single operation. All the possible positions open to the drill 
soldier through a sequence of two moves are also achievable through a single move. 
In other words, no product of elements takes us out of the group—the group exhibits 
closure. 

The reciprocal, or inverse of an operation is that subsequent operation which returns 
the soldier to his original position. Hence, L is the reciprocal of R, which is expressed 
as LR = E, or L = R-!. Examination of Table 13-1 shows that every column has E 
appearing once, which means that every element of our group has an inverse in the 
group. 

The associative law is obeyed if, in general, the sequence of operations A( BC) = 
(A B)C, where the parentheses enclose the pair of elements the product of which is to 
be evaluated first. We can test whether our elements satisfy this law by trying out all the 
combinations. For example L(RA) = LL = A and (LR)A— E A = A. One can quickly 
show that this group satisfies the associative law for all combinations of elements. 

We are now in a position to define a group. A group is a set of elements that meets 
the following requirements: 


1. The group contains the identity element (traditionally symbolized E) which corre- 
sponds to “make no change." 


2. The group exhibits closure with respect to “multiplication.” The product of any two 
elements in the group is a single element in the group. 


3. There is a reciprocal in the group for every element of the group. 
4. The elements of the group obey the associativity law (4B)C = A(BC). 


We mentioned that, in general, group elements need not commute. That is, it is 
possible that 4 B # BA. However, in this example, the elements do all commute. This 
results in Table 13-1 being symmetric about the main diagonal. Groups in which all 
the elements commute are called abelian groups. 
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13-3 Symmetry Point Groups 


Our example of four operations of a drill soldier differs from the kinds of groups most 
commonly used in quantum chemistry because it is not a symmetry group. Assuming 
that the soldier begins by facing north, we are able to distinguish four distinct possible 
subsequent orientations for him (facing the four compass directions). After a symmetry 
operation, an object is understood to be indistinguishable from the object before the 
operation. 

Whether or not an operation is a symmetry operation must be decided within the 
context of the object being operated on. For example, a square (devoid of identify- 
ing marks enabling us to distinguish one corner from another; see Fig. 13-1a) may be 
rotated by 90? about an axis perpendicular to the center (Fig. 13-1b), and be indistin- 
guishable from its starting configuration. However, rotation by 120? does not lead to 
an indistinguishable configuration (Fig. 13-1c). 

Therefore, rotation by 90? is a symmetry operation for a square, but rotation by 120? 
is not. However, for an equilateral triangle, rotation by 90? is not asymmetry operation, 
while rotation by 120? is. For a circle, both rotations are symmetry operations. The 
only operation that is a symmetry operation for every object is the identity operation £E. 

Another sort of symmetry operation pertains to infinite networks such as occur in 
idealized models of crystals. Here we can define operations that move an infinite 
line of cells or atoms to the left or right by a unit number of “steps,” producing a 
configuration indistinguishable from the initial one (Fig. 13-2). Symmetry operations 
can be usefully categorized into those that leave at least one point in space unmoved 
(rotations, reflections, inversion) and those that do not (translations). The latter category 
is of importance in the fields of crystallography and solid-state physics. The former 
category is useful when we deal with systems that do not undergo unending periodic 
repetition in space, and it is with this category that we will be concerned in this chapter. 

If we pick some object or shape of finite size and construct a group from symmetry 
operations for that shape, we have a symmetry point group for that object. We now 
illustrate how this is done for a ball-and-stick model of the ammonia molecule in its 
equilibrium nuclear configuration. It is not difficult to find operations that do no more 
than interchange identical hydrogen nuclei. There are a number of possible rotations 
about the z axis (Fig. 13-3). One could rotate by 120°, 240°, 360°, 480°, etc., either 


(a) (b) (c) 


Figure 13-1 » Effects of rotations on a square. 


Move to right two units ———»— 


Figure 13-2 » Segment of an infinite repeating sequence of identical “cells”. 
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Figure 13-4 » Three symmetry reflection planes for ammonia. 


clockwise or counterclockwise. Then there are three reflection planes. One of these 
is the xz plane (labeled “A” in Fig. 13-4) and the other two are planes containing the 
z axis and one of the other N-H bonds. Also there is the identity operation Æ, which 
we know that we need to satisfy the general requirements for a group. 

There is a problem with the rotations, and this is that we could produce an infinite 
number of them. How do we know when to stop? The answer is that we select only 
those that avoid “redundancies” among our indistinguishable configurations. Redun- 
dancies occur when two operations lead to "identical" indistinguishable configurations. 
For example, rotation by 360? is redundant with the identity operation because both 
operations leave the protons (which we imagine for the moment to be distinguishable) 
in the same locations. Similarly, rotation clockwise by 240? is redundant with rotation 
counterclockwise by 120? (Fig. 13-5). If we remove all such redundancies, we are left 
with only two rotations. 

There is arbitrariness as to how we wish to describe these. If we let the first be 
clockwise rotation by 120°, the second could be described as either clockwise rotation 


1 O 3 O 1 
Counter clockwise Clockwise 
120° 240° 
————— ———— ————ÓPr 
2 1 2 
4 2 3 


Figure 13-5 » Identical configurations result from the two rotations shown. 
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by 240? or counterclockwise rotation by 120°. We will use the latter description. (Note 
that the clock is assumed to face +oo on the axis about which rotation occurs; see 
Fig. 13-3.) 

Our list of symmetry operations is now as follows: 


. Identity (£). 
. Reflection through xz plane (A). 
. Reflection through plane B (B). 


1 

2 

3 

4. Reflection through plane C (C). 

5. 120? clockwise rotation about z axis (D). 
6 


. 120° counterclockwise rotation about z axis (F). 


We now proceed to set up the multiplication table for these operations. It is sug- 
gested that the reader do this as an exercise, checking the result against Table 13-2. 
Construction of the multiplication table is more challenging in this case than in our 
earlier example. 

Let us now use this table to see if the requirements for a group are satisfied by these 
symmetry elements. Since every product of two operations is equivalent to one of the 
operations of the set (i.e., since every spot in the table is occupied by one of our six 
symbols) the set exhibits closure. Also, the identity element is present in the set and 
occurs once in each column and row (Problem 13-2), and so every element has an 
inverse in the set. The associative law is satisfied, as one can establish by trying various 
examples, e.g., D(CB) = DF = E, (DC)B= BB = E, so that D(C B) = (DC)B. (In 
general, symmetry operations satisfy the associativity law.) Therefore, our set of six 
symmetry operations constitutes a symmetry point group of order 6. 

Inspection of Table 13-2 reveals that it is not symmetric across its principal diagonal. 
For example, CF = B, and FC = 4A; this is not an abelian group. One might inquire 
whether this group is the smallest that we can set up for the ammonia model. Inspection 
of the multiplication table should convince the reader that the following subsets meet 
the requirements for a group: E; E, D, F; E, A; E, B; E, C. These subgroups can be 


TABLE 13-2 » Multiplication Table for 
Symmetry Operations of an Ammonia Molecule 


First operation 


Second operation 
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distinguished from the full group by virtue of the fact that none of them exhausts all 
the possible physically achievable, nonredundant, indistinguishable configurations for 
the molecule. 


EXAMPLE 13-1 BH3, like ammonia, has a three-fold symmetry axis. However, 
BH; is planar. As a result, there is an extra symmetry operation—Treflection through 


the molecular plane—that does not move any nuclei. Thus, both the identity 
operation and this reflection leave all nuclei unmoved. Is this an example of 
redundant operations? 


SOLUTION » These operations are not redundant. Even though all the nuclei stay fixed for 
both operations, reflection interchanges the spaces above and below the plane, whereas the identity 
operation changes nothing. Two operations are redundant only if they move all points in space in 
the same way. 4 


[J 13-4 The Concept of Class 


Imagine that we are subjecting our ammonia model to the various symmetry operations 
and that someone in a parallel universe is subjecting his ammonia model to symmetry 
operations too. Suppose that the parallel universe differs from ours in that everything is 
reflected through the xz plane. We shall refer to this as “the mirror A universe." We now 
pose the following problem. Suppose we initially have a particular configuration and 
our “mirror A” man has the corresponding configuration (achieved by reflecting through 
plane A) as shown in Fig. 13-6. If the mirror 4 man now performs some symmetry 


This universe Mirror A universe 


3 2 
Starting A i 
: A DLP LLLP 
orientation 1 
2 3 
x : : F 
2 1 
A'! (=A) 
3 3 
1 2 


Figure 13-6 » Operation X in this universe parallels operation F in the mirror A universe. 
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operation on his model, say, counterclockwise rotation by 120°, F, he ends up with 
the final configuration shown at the lower right of the figure. (We continue to define 
"clockwise" from clocks in our universe.) For us to arrive at the corresponding final 
orientation, what operation must we perform? We symbolize this unknown operation 
X. We can find out what our final configuration must be by taking the reciprocal of 
reflection A on the mirror man's final configuration. (A takes us from here to there; 
AT! takes us from there to here.) But AT! = A, so we obtain the result at the lower 
left of the figure. It is evident that X must be D, clockwise rotation by 120°. This is 
not surprising. It is related to the fact that, when you turn counterclockwise before a 
mirror, your image appears to turn clockwise. 

We can repeat the solution to this problem in a more formal way by requiring that the 
operation X followed by A (into the mirror A universe) bring us to the same configura- 
tion as operation A followed by F. That is, we seek X such that AX = FA. Multiplying 
both sides from the left by 47! gives X= A-! FA. This is merely a mathematical 
statement of our above discussion. It says, “What operation is equivalent to the process 
of reflection through plane A followed by counterclockwise rotation by 120° followed 
by the inverse of reflection through plane A?” Using our multiplication table, we see 
that AT! = A, and so X= AFA = AB = D. Therefore, D = A^! FA. 

We can find the mirror A universe equivalents to all the symmetry operations in the 
group in a similar way. Thus, 


A'EA=E, A AA-A, A BA-C, 
A!CA=B, 4A DA-F, A FAzD 


There is no need to stop here. We can imagine other parallel universes corresponding 
to reflections B and C and rotations D and F. (The E universe is the one we inhabit.) 
We can use the same sort of technique to accumulate corresponding operations for all 
these universes. The results are given in Table 13-3. 

If we examine this table, we note certain patterns. The operation E in our universe 
corresponds to £ in all the universes. The reflections (A, B, and C) always correspond 
to reflections, and the rotations (D and F) always correspond to rotations. This makes 
physical sense. If we “do nothing" (£) in our universe, we expect the people in the other 
universes to do nothing also. If we rotate by 120°, we expect the people to perform 


TABLE 13-3 » X-—Y-!ZY as a 
Function of Y and Z 
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rotations by 120? too, although not necessarily always in the same direction. The 
argument for reflection is the same. These are examples of three classes of operation. 
The formal definition of class is as follows: If P and Q in a group have the property 
that X-! PX = P or Q and X^! QX — P or Q for all members X in the group, then 
P and Q belong to the same class. Physically, operations in the same class are of the 
“same kind"—all reflections, all rotations, etc. 


EXAMPLE 13-2 BH;, like ammonia, has three planes of reflection symmetry that 
contain the three-fold rotational axis. However, BH3, being planar, also has a 


reflection operation through the molecular plane. Is this reflection operation in the 
same class as the other three? 


SOLUTION » No. In both molecules, the three planes each interchange two hydrogens and 
keep one fixed in space. The extra reflection plane in BH3 keeps all three hydrogens fixed. If we 
reflect so as to keep all hydrogens fixed, there is no way a person in a symmetry-related universe 
could make a corresponding reflection by using a reflection that interchanges two hydrogens and 
keeps one fixed. 4 


We shall see that classes are important subdivisions of groups. Note, however, that 
a class need not be a subgroup. For instance, D and F constitute a class, but, as the 
class does not include Æ, it is not a subgroup. 

In discussing the concept of class, it is unnecessary to postulate parallel universes, 
and the reader should not be disturbed by this pedagogical device. The people in the 
"other universes" are merely working with ammonia models that have been reflected or 
rotated with respect to the model orientation that we chose in Fig. 13-3. Operations in 
the same class are simply operations that become interchanged if our coordinate system 
is subjected to one of the symmetry operations of the group. 


E 13-5 Symmetry Elements and Their Notation 


There are five kinds of symmetry operations that one can utilize to move an object 
through a maximum number of indistinguishable configurations. One is the trivial 
identity operation E. Each of the other kinds of symmetry operation has an associated 
symmetry element! in the object. For example, our ammonia model has three reflection 
operations, each of which has an associated reflection plane as its symmetry element. 
It also has two rotation operations and these are associated with a common rotation 
axis as symmetry element. The axis is said to be three-fold in this case because the 
associated rotations are each one-third of a complete cycle. In general, rotation by 
2x / n radians is said to occur about an n-fold axis. Another kind of operation—one we 
have encountered before is inversion, and it has a point of inversion as its symmetry 
element. Finally, there is an operation known as improper rotation. In this operation, 
we first rotate the object by some fraction of a cycle about an axis, and then reflect it 
through a plane perpendicular to the rotation axis. The axis is the symmetry element 
and is called an improper axis. 

The following two examples should help clarify the nature of these symmetry ele- 
ments. Consider first the ethane molecule in its eclipsed conformation (I). 


' This term is not to be confused with a group element. 


Section 13-5 Symmetry Elements and Their Notation 437 


pd fd threefold 
`H 


D 


The following symmetry elements can be identified. (It is important that you satisfy 
yourself that you see these elements in the sketch.) 


1. One three-fold axis coincident with the C-C bond. 

2. Three two-fold axes perpendicular to the C-C bond and intersecting its midpoint. 
3. Three reflection planes, each containing the C-C bond and a pair of C-H bonds. 
4. One reflection plane perpendicular to the C-C bond and bisecting it. 

5. No point of inversion. 

6. One three-fold improper axis coincident with the C-C bond. 


[Note that these operations are to be applied to the rigid molecule. Movement of 
only one methyl group and not the other (i.e., internal rotation) is not allowed.] 

Now let us consider staggered ethane (II) (i.e., one methyl group rotated 60° from 
its eclipsed position). Its symmetry elements are 


H 


iN 


H,.) 
act threefold 


eden 
aD 


1. One three-fold axis coincident with the C-C bond. 
. Three two-fold axes perpendicular to the C-C bond and intersecting its midpoint. 


. Three reflection planes, each containing the C-C bond and a pair of C-H bonds. 


2 

3 

4. No reflection plane perpendicular to the C-C bond. 

5. One point of inversion at the midpoint of the C-C bond. 
6 


. One six-fold improper axis coincident with the C-C bond. 


In going from the eclipsed to the staggered conformation, we have lost a reflection 
plane perpendicular to the C-C bond, gained a point of inversion, and changed the 
order of the improper axis. 

Usually it is not difficult to “see” most elements of symmetry in molecules, the 
exception being improper axes, which tend to be a little tricky. The six-fold improper 
rotation in staggered ethane is not too hard to envision when it is applied once. If we 
rotate clockwise by 60? and reflect, Hı replaces Hs, H5 replaces Ho, etc. It is a little 


438 Chapter 13 Group Theory 


more difficult when we try to imagine applying this operation twice in succession. The 
reader should recognize that two 60° rotations and two reflections result in Hı replacing 
H5, H? replacing H5, H4 replacing Hs, etc. 


EXAMPLE 13-3 BH; has a three-fold axis and a reflection plane in the molecular 


plane. Does BH3 have a three-fold improper rotation axis as a necessary member 
of its group? 


SOLUTION » A three-fold improper axis coincident with the “regular” three-fold rotational axis 
appears to have the correct symmetry characteristics: It rotates each hydrogen to the next position 
and then interchanges the spaces above and below the plane. Do we need it in the group? Only if 
there is no other single operation that accomplishes the same thing. There is no such operation, so 
this improper axis is indeed a nonredundant symmetry element. < 


A notation for these symmetry elements (and for the operations related to them) has 
come to be generally accepted in chemistry (exclusive of crystallography, which uses a 
different notation). This notation is summarized in Table 13-4. The symmetry elements 
of eclipsed ethane would be, by these conventions, C3, 3C2, o (perpendicular to C-C), 
3o (containing C-C), S5. It often occurs that several classes of reflection planes and 
axes for proper or improper rotations are present in a system. In eclipsed ethane, we 
can imagine that the three reflection planes containing the C-C bond might all be in 
the same class, that is, might be interchanged by a symmetry operation. In fact, it is 
easy to see that they are interchanged by rotations about the threefold axis. However, 
none of these could ever be equivalent to the reflection perpendicular to the C-C bond 
because this reflection interchanges the carbons, whereas the others do not. For this 
reason, we keep separate tally of the different classes of reflection in eclipsed ethane, 
rather than simply writing “40.” 

There is a geometric convention that aids discussion of molecules having axes of 
rotation. One looks for a unique axis C; (usually the axis of highest order) and imagines 
this axis to be vertical (coincident with the zaxis). Then a reflection plane perpendicular 
to this axis is horizontal and is labeled øp. Planes containing this axis are necessarily 
vertical and are subdivided into dihedral and vertical planes. Dihedral planes must 
contain the unique reference axis, Cn, and must also bisect the angles between two-fold 


TABLE 13-4 » Symbols for Symmetry Elements and Operations 


Symmetry operation Symmetry element Symbol 
“Do nothing" (identity) None E 
Rotation by 27 /n radians An n-fold (proper) axis Ch 
Reflection through plane A plane [o 
Inversion through a point A point i 
Rotation through 2z:/n radians An n-fold (improper) axis Sh 


followed by reflection 
through a plane 
perpendicular to 

the rotation axis 
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axes perpendicular to C,. Such planes are labeled oq. Vertical planes that do not 
bisect two-fold axes? are labeled oy. In eclipsed ethane, the C3 axis is the principal 
axis, so the C-C bond is oriented vertically. Therefore, our symmetry elements are 
labeled C3, 3C2, op, 3oy, S3. (The vertical planes are labeled oy because they contain, 
rather than bisect, the two-fold axes that are perpendicular to the principal axis.) For 
staggered ethane, we have C3, 3C2, i, 304, S6. (Here the vertical planes do not contain 
the two-fold axes, but do bisect them. Hence, we label them og. Making a simple 
sketch will aid in clarifying this distinction.) 

Once one has recognized the set of symmetry elements associated with a given 
object, it is a straightforward matter to list the symmetry operations associated with the 
set. Simplest are the operations associated with elements o and 7, because each such 
element gives rise to only one operation. Proper and improper axes are somewhat more 
complicated. Let us return to our ammonia molecule for illustration of this. There we 
had a threefold axis C3 and we noted that we could rotate by 27/3 (C. ) to get one con- 
figuration, and 47/3 [Cy Cy = (cx )?] to get another. Alternatively we could choose 
to rotate by 27/3 and —27 /3 (C3 ), the latter easily being shown to be equivalent to 
(ye But if we rotate by (Cy )3, we return to our original configuration. That is, 
(C ae — E. Therefore, a C3 axis produces two unique operations. The reader can easily 
generalize this to the statement that a C, axis yields n — 1 unique symmetry operations 
Crs Cir 32-3 Qim, where we assume rotation to be in the clockwise direction in 
all cases. Benzene has a C6 axis with five associated nonredundant symmetry operations 
(C6, Ce, Ce, Ce 3 @ ). Now C? is equivalent to a rotation by x radians, an operation 
we normally write as C2. Similarly, C c = C3. The result of such reductions is the set of 
operations Ce, C3, C2, C x. C; for the Cg axis of benzene. There are several classes of 
rotation here. If we use a compressed notation, we can write this set as 2Cg, 2C3, C2. 
This indicates that the presence of a sixfold axis implies the existence of coincident 
twofold and threefold axes. However, these implied elements are not listed for such a 
system since their operations are all contained in the set of operations of the Cg axis. 

Improper axes can also be associated with several symmetry operations. We noted 
earlier that S6, applied twice in succession, results in a simple 27/3 rotation about 
the S¢ axis. In other words, we can write the set S6, S, S2, S4, S as S6, C3, So, s. 
We stop at SD because S° = E due to the combination of C$ and an even number of 
reflections. S3 is equivalent to S» because it contains three rotations by 27/6 and an 
odd number of reflections, and S2 means one rotation by z and one reflection. The 
operation S2, however, is easily shown to be equivalent to an inversion, and so we 
have, using a compressed notation, 2.55, 2C3, i associated with the Sg axis. Since we 
have already explicitly listed the elements C3 and i in our set of elements for staggered 
ethane (or any other system containing an Se axis) only the 25s operations are unique 
to the Sg axis. The generalization of this case is that an $5, axis with odd n implies that 
elements C, and i are also present. Of the 2n — 1 operations associated with S24, n — 1 
are preempted by the C, axis and 1 by the element į leaving n — 1 operations to be 
attributed to the S», axis. [If n = 1, we have S5, C1, and i as elements. But C4 = E, so 
we ignore it. There is only one operation here (2 - 1 — 1 = 1) and it is preempted by i. 
Therefore, S2 has no unique operations and it is not listed as a symmetry element.] For 
S», With n even, i is not implied and there are n unique operations. 


?In certain cases there will be two geometrically nonequivalent sets of vertical planes that cannot be distinguished 
on the basis of bisecting two-fold axes (e.g., in square-planar XeF4). In such cases the convention is to take the 
Oy planes to be those containing the larger number of atoms. 
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EXAMPLE 13-4 How many operations must be associated with S4 in the square- 


planar molecule XeCl4? 


SOLUTION » This molecule possesses a C2 and a C4 axis coincident with S4. C2 usurps si 
since the latter has two reflections that cancel, making it just like Cy. (Cz also usurps c2.) S4 =E. 
Only two operations, S l and Se survive as unique symmetry operations < 


In eclipsed ethane, we have the element S5, which generates operations S5, 
S2, S3. S4, S9, S®,... In deciding where to stop here, we note that S$ Æ E because we 
have here an odd number of reflections. Therefore, S$ =o, where we understand this 
to be reflection through the plane perpendicular to the $3 axis. For eclipsed ethane, 
this is oy. The element s$ has an even number of reflections and is equal to E. How 
about Si ? This has an even number of reflections, so it is identical to Cs which is the 
same as c3 C3 = EC3 = C3. Thus, our $3 element has operations that are consistent 
with the presence of C3 and op elements. We note that eclipsed ethane was indeed 
found to have these elements. Once again, we allow these elements to preempt their 
operations from 53. This leaves only two unique operations, S3 and S>, corresponding 
to clockwise and counterclockwise improper rotations by 27/3 radians. In general, 
Son41 Will occur only when C5,4, and o (perpendicular to C2,+41) are also present. 
These preempt a total of 2n + 1 operations from the total of 2(2n + 1) — 1 2 4n + 1 
we can achieve with S2n+1 alone; and we are left with 2n operations for S2n+1. The 
above conclusions are summarized in Table 13-5. 

The set of symmetry operations is contrasted to the set of symmetry elements for 
eclipsed and staggered ethane in Table 13-6. Note that there are 12 symmetry operations 
in each case. By setting up the multiplication table for either of these sets of 12 
operations, we can show that the mathematical requirements for a group are satisfied. 
Thus, each of these sets of symmetry operations constitutes a separate group of order 12. 

The multiplication table for the ammonia model is given in Table 13-7 in terms of 
the symmetry symbols just described. (A, B, C are taken to be o1, 02, 035, respectively.) 

It would be possible, using what has been described up to this point, to construct 
a list of symmetry operations for any object we please. Then we could test the set to 
see if it satisfied the various requirements for a group. In practice, this is not done. 
It turns out that (1) only a rather limited number of distinct kinds of symmetry are 
possible, and (2) in each case, the symmetry operations for the object do form a group. 
The practical question, then, is, given an object, to which of the known groups does it 


TABLE 13-5 » Number of Operations Associated with Symmetry Elements 


Element (Other elements present) No. operations 
i — 1 
o — 1 
Ch — n—1 
Son il — odd : C, and i present n — | 
n — even: C, present n 


Son41 Con41, 0 2n 
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TABLE 13-6 » Symmetry Operations and Elements in Ethane 


441 


Molecule Elements Operations 
E 
C3 2C3 
3C5 3C» 
Oh Oh 
3o, 30y 
|» Qof3) S3 285 
Eelipsed Group order: 12 
E 
C3 2C3 
3C5 3C» 
i i 
304 304 
ss 286 


Staggered 


Group order: 12 


TABLE 13-7 » Multiplication Table for the Ammonia Molecule 


First operation 


E 0] op) 03 C4 C4 
a 3 3 
o 
H E E 0] 02 03 Cc; C4 
5 REC. EE. 
E O71 O71 E C; C4 02 03 
3 02 02 C4 E C3 03 01 
S 03 03 Ce Cz E 0] 02 
ue Ce C 03 01 02 C4 E 
C3 C4 op) 03 01 E Cy 


belong? Once this is settled, we merely look up the tabulated properties of that group 


and save ourselves the effort of working through all the details. 


[J 13-6 Identifying the Point Group of a Molecule 


One can decide to which point group a molecule belongs by systematically looking for 
certain symmetry elements. Each symmetry point group has a unique group symbol, so 
basically one tries to figure out which group symbol is appropriate for a given molecule. 
A flowchart that serves this purpose is displayed in Fig. 13-7. As an illustration, we will 
work out the group symbol for staggered ethane. The first few questions in the flowchart 
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Does the molecule 
possess six C; axes? 


No 


T, Ty Ty, O, or Op 
(Examine character tables 
to choose among these) 


Is there a 
reflection plane? 


p 
No 


Does the molecule 
possess four C, axes? 


Are there one or more 
proper rotation axes? 


Yes 


Is there an axis is having 
higher order than any 
other proper axis? 


Of the several axes having 
highest order, is there one 
which is geometrically unique? 


Select this as 
reference axis Cn 


Selection of reference axis 
Cn. (Defines vertical direction.) 


Choose any of the 
equivalent axes as Cn 


(Cn and n now defined) 
Is there an Sop axis 
collinear with Cn? 

No 


Are there any symmetry 
elements present other 
than Cp, Say or i? 


Are there n twofold axes 
perpendicular to Cn? 


No y (C branch) 


Yes (D branch) 


Is there a reflection 
plane (on) perpendicular 
to Cn? 


Is there a reflection 
plane (ch) perpendicular 
to Cn? 


(pe) = 
o Yes Is there a set of 
Is there a set of nreflection planes 


n reflection planes 


containing Cn and 
bisecting the n 
No twofold axes? 


(oy) containing Cn? 


Figure 13-7 » Flow scheme for group symbols. 
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check to see whether the molecule belongs to one of several special groups of very 
high symmetry. Since staggered ethane is not linear, and since it does not possess the 
symmetry of any of the regular polyhedra, we move to the question concerning proper 
rotation axes. We have noted earlier that there are several proper rotation axes in this 
molecule. We next consider which one should be selected as the reference axis. The 
axis of highest order is threefold and there is only one of these; therefore, we choose 
it as our reference axis. This defines n as 3 for the remainder of the flowchart, and 
determines the vertical direction. Next we look to see if there is an S6 axis coincident 
with the C5 axis, and we find that there is. Checking to see if symmetry elements other 
than C3, Se, and i exist, we find that they do indeed, and that among them are three 
twofold axes perpendicular to the reference axis C3. This leads us along the “ D branch” 
of the diagram, which means that, whatever else we find, the major symbol for our 
group will be D. Next we look for a op reflection plane. Finding none, we next seek 
a set of reflection planes containing C3 and also bisecting the various C2 axes. Such 
dihedral planes are present, so our group symbol is D3q for staggered ethane. The 
reader may verify that eclipsed ethane follows the same route out to the D branch, but 
there we do find a oy (horizontal) plane, and so eclipsed ethane has D3, symmetry. 
A simple exercise, suggested at this point, is to ascertain the symmetry symbol for 
ammonia. (The answer is given in the next section.) 


EXAMPLE 13-5 What is the group symbol for (planar) BH3? 


SOLUTION » Starting at the top of Fig. 13-7, the answers “no, no, no, yes" bring us to the box 
for selection of the reference axis. There is one three-fold (proper) rotational axis, and no other of 
higher order, so this defines n (= 3) and “vertical.” Coming out of the box we get answers “no, 
yes, yes,” giving D3} as the group symbol. Notice that we arrive at this conclusion without having 
had to use all the symmetry elements of the molecule (S5 or vertical reflection planes). 4 


[J 13-7 Representations for Groups 


We have seen that the group of symmetry operations for the ammonia molecule leads 
to a particular group table of product operations (Table 13-7). Let us now see if we 
can assign a number or matrix to each symmetry operation such that the products of 
numbers satisfy the same group multiplication table relationships as do the products 
of symmetry operations. If we can find such a set of numbers or matrices, we say we 
have a representation for the group. 

It is always possible to produce a trivial representation by simply assigning the 
number +1 to each operation. Then any operator relation, such as 0102 = e is 
necessarily satisfied by the numbers since 1 - 1 = 1. Nontrivial representations exist too 
(except for the C, group). For example, the group for molecules (like ammonia) having 
Cs, symmetry can be represented by the set of numbers +1, —1, —1, —1, +1, +1, 
respectively, for the operations £, o1, 02, 03, one , C4 in Table 13-7. Then the relation 
oC} = 03 is paralleled by the representation since (—1)(+1) = —1, etc. It is also 
possible to find a representation for the C3y group wherein each operation corresponds 
to a2 x 2 matrix. This is shown in Table 13-8, along with the two other representations 
already mentioned. The reader is encouraged to test that, for instance, oC} = 03 in 
this matrix representation. 


444 Chapter 13 Group Theory 


TABLE 13-8 » Representations for the C3, Group 


+ — 
E 01 02 03 C; C; 


Tı 1 1 1 1 1 1 
T5 1 -1 —1 -1 1 1 


i 10 1 0 —1/2 43/2 —1/2 -V3/2 —1/2 J3]2 —1/2 —/3/2 
Api 0 -1 V3/2 41/2 —/3/2 1/2 —43/2 —1/2 J3/2 —Ay2 


TABLE 13-9 » A Two-Dimensional Unit Matrix Representation for the C3, Group 


E 01 02 03 C a C3 


69 69 690 69 6) C) 


The symbol T; is often used to stand for the ith representation of a group. T;(R) 
refers to the representation of the particular operation R in the ith representation. Thus, 
I2?(03) 2 —1. In our ammonia example, I; and I? are one-dimensional representations 
and T3 is two-dimensional. The representation I’; is really the simplest member of a set 
of equally trivial representations, namely all representations made from unit matrices. 
Thus, the representation I" (Table 13-9) is just as successful a representation as is 
Tı. We could obviously use unit matrices of arbitrary dimension in constructing a 
representation. We can think of such representations as being “built up” from T1: 


X : ) etc. 
0 Ti(R) 


It is possible to extend this approach by building up representations from mixtures 
of I, D», I5. Thus, 
T(R) 0 0 0 
0 T(R) 0 0 
1 \ 


I"(R) = 
(R) P 


i TI 
0 à / 


would produce the representation given in Table 13-10. It is easy to show that these 
matrices multiply together in a way that parallels the group table for symmetry oper- 


TABLE 13-10 » A C3, Group Representation Built from Other Representations 


E onl 02 
1000 -100 0 -10 0 0 
r^ (8) 0100 010 0 01 0 0 
0010 001 0 00-1 4302 
0001 000 -I 00 J32 4 
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ations, since block diagonal matrices always multiply in a block-for-block manner. 
Hence, the upper left 1 x 1 blocks (T2) multiply among themselves, the 1 x 1 blocks 
in the 2,2 positions (T1) multiply among themselves, and the 2 x 2 blocks at the lower 
right (T3) multiply among themselves. Since each of these sets conforms to the group 
table, it follows that their composite I" will also. It is evident that there is no limit to 
the number of representations we could build up in this manner. 

Representations like I" and I" are called reducible representations because they 
are composites of one or more smaller-dimensional representations and hence can be 
decomposed into those smaller representations. Any representation that cannot be 
reduced, or decomposed, into smaller representations is said to be irreducible. 

How can we tell whether a given representation is reducible or not? If it is already 
one dimensional, it is obviously irreducible. If it is multidimensional and, like Tl" or 
T”, uniformly block diagonalized throughout the whole set of symmetry operations, 
it is plainly reducible. The trouble comes when it is multidimensional but not block 
diagonalized, for example, 3. It is sometimes the case that such representations are 
reducible. To illustrate this point, let us go back to I", which we know is reducible, and 
imagine transforming this representation by multiplying every matrix from the right by 
B and from the left 87!, where f is some 4 x 4 unitary matrix (see Chapter 9). This 
will give us a new set of six 4 x 4 matrices that no longer will necessarily be block 
diagonalized. Yet it is easy to show that these new matrices, TÊ, are still a representation 
for our group. We prove this by showing that, if three matrices from F” (call them A", 
B", C") are related by A” B" = C”, then the corresponding transformed matrices 4^, 
BP, CB, satisfy A? BP = CP. If this is true, then the ability of I" to correspond to the 
group table relationships is retained by I^. The proof is as follows: 

Let 


A" B" =C" 
Then 
AP pP —-!4"pBp-!p"g —-p- 4"B"g-2p-c'ga C? 


The point we are trying to make here is that a reducible representation like I'" can 
be put forth in many guises, each corresponding to a different choice of B, and many 
of these will not be block diagonal in form. (Representations that differ only by a 
unitary transformation are said to be equivalent. Thus, T“ and TÊ are equivalent 
representations.) To show that such a representation as T? is reducible and also to 
reveal its component representations, we could back transform it by finding the matrix 
P and calculating 


pr^g 12 pp Ir"gg top 


03 on C 
-10 0 0 10 0 0 10 0 0 
01 0 0 01 0 0 01 0 0 
00 -1 —¥3/2 00 -i 430 00-5 -4302 
00-43/ 4 0 0-43/2 -i 0 0 43/2 -i 
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Therefore, deciding whether or not a multidimensional representation is reducible is the 
same as deciding whether it can be uniformly block-diagonalized through a common 
unitary transformation. It turns outthat there is no single unitary transformation capable 
of diagonalizing all the 2 x 2 matrices of 3, and so T3 is an irreducible two-dimensional 
representation. (We shall show later that this can be established without "trying out" an 
infinite number of transformations.) For any group, a goal is to find all the inequivalent, 
irreducible representations possible. Anything beyond this is superfluous information. 
For our C3, (ammonia) group, l1, I'?, and I5 exhaust the possibilities and constitute 
a complete set of inequivalent irreducible representations. 


13-8 Generating Representations from Basis Functions 


The reader may wonder how one goes about discovering nontrivial representations like 
L4? A convenient way to do this will now be described, and we will show at this 
stage the connection between quantum mechanics and the group theory of symmetry 
operations. 


Cl, 
Fa h^ L- Br, 
Ca—— C 
CI; N 
Bra Fo 


Consider the molecule shown in (III). This molecule has but one nontrivial symmetry 
element—a point of inversion. According to our flowchart, this places it in the C; point 
group. The only symmetry operations here are E and i. Now consider two functions, 
fi and f2. Let fı be located on one end of the molecule. For instance, let fı be 1sp,, 
a ls AO centered on the fluorine atom on the left side of the molecule. Let /? be a 
similar function on the other side of the molecule, Isg,. Now let us see what happens 
to these functions when they are acted upon by our symmetry operations E and i: 


Ef—fí, Eh-f. ifh-f» ih=f 


We see that fj and f» are interchanged by inversion. Let us try to find numbers to 
represent these results. Clearly, replacing E by +1 will give the correct result. However, 
to obtain the effect of operation by i we need to interchange fj and f2. We cannot 
achieve this by multiplying by a number, since fı and fz are linearly independent. If, 
however, we rewrite the effect of i as 


it becomes clear that 7 can be represented by the matrix Ca: Thus, use of the functions 
Isp, and lsp, has generated a two-dimensional representation shown in Table 13-11. 
(The representation for E has been put into a 2 x 2 matrix form to be in dimensional 
agreement with the representation for i.) 1sp, and Isp, are called a basis for I'. Clearly, 


3 Indeed, the reader may wonder why we even worry about all this. Patience. 
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TABLE 13-11 » A Representation for the C; Group 


C; E i Basis 


10 0 1 
T 1 1 
D a i " (sp, Isp,) 


any pair of identical functions symmetrically placed with respect to the point of inver- 
sion would generate the same two-dimensional representation, and so there is nothing 
very special about Isp, and Isp, as a basis. Is I reducible? It is easily argued that it 
must be. Any unitary 2 x 2 transformation will have no effect on the matrix for E since 
p-18— 8-! B— 1. We are thus at liberty to look for any unitary transformation f that 
diagonalizes the second matrix. Since this is a nonsingular matrix (its determinant is 
unequal to zero), it should be diagonalizable, and it is if we take 


UJ2. 1//2) (0 1\ (1/72 —uy2|. (1 0 
“17/2 12] \1 0] X12. 342 VOLGI 
Now that our 2 x 2 representation has been diagonalized to two 1 x 1 “blocks,” we can 
rewrite our representations as shown in Table 13-12. 
Our first choice of basis generated a reducible representation. What bases would we 
need to generate the 1 x 1 representations I"; and I?? To generate I"), we need a basis 
function that turns into itself when it is inverted. One possibility is fj = Isp, + Isp,. 


If we sketch this function (Fig. 13-82), it is evident that it is regenerated unchanged by 
inversion. The mathematical demonstration is 


ifi — i(lsp, + 1sp;) — ilsp, + ilsp, = lsp, + Isp, = fi 


TABLE 13-12 » Reduced Representations for 


the C; Group 

Cj E i Basis 
ri 1 1 ? 
I? 1 —1 ? 


(a) (b) 


Figure 13-8 » Basis functions for irreducible representations of the C; group. 
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fi is symmetric for operations E and i. For 2, we need some function fù that turns 
into minus itself upon inversion. f? = lsp, — Isp, would serve, as Fig. 13-8b shows, 
and as is demonstrated mathematically by 


i f? — i(1sgp, I Isp,) —ilsp, m ilsp, = Isp, — Isr, = — f. 


fo is symmetric for E and antisymmetric for i. (Notice that the way in which Isp, 
and Isp, needed to be mixed to produce a diagonal representation is indicated by the 
coefficients in the columns of the matrix we used to diagonalize the original 2 x 2 
representation. The matrix 6 not only diagonalizes our I’, it also tells us how to mix 
the original bases in order to arrive at bases for the irreducible representations.) 

This example demonstrates an important fact In order to generate a one- 
dimensional representation for a group, we need a basis function that is either 
symmetric or antisymmetric for every symmetry operation in the group. This provides 
the point of connection with quantum mechanics. We showed much earlier (Chapter 2) 
that any nondegenerate wavefunction (or MO) must be symmetric or antisymmetric 
for every operation that leaves the hamiltonian unchanged. Since symmetry operations 
leave the hamiltonian unchanged (they merely interchange identical nuclei), it follows 
that nondegenerate wavefunctions or orbitals are symmetric or antisymmetric for 
every operation in the symmetry point group of the molecule. This means that every 
nondegenerate wavefunction or orbital is a basis for a one-dimensional representation 
for its molecular point group. Indeed, it can be proved that every wavefunction or 
orbital, even if degenerate, is a member of a basis for an irreducible representation for 
the point group of the molecule. The irreducible representation produced by an n-fold 
degenerate set of orbitals will be n-dimensional. (We indicate how this comes about in 
a later section.) Therefore, knowing wavefunctions enables one to generate represen- 
tations. More importantly, knowing representations allows us to say something about 
wavefunctions. For example, the representation table (Table 13-12) for the C; group 
(which is now complete—this group has only two irreducible inequivalent representa- 
tions) tells us the following: (1) The molecule FCIBrC—CBrCIF in the conformation 
pictured has no degeneracies due to symmetry in its electronic states or in MO energies 
Gf we do a calculation at the MO level). We can tell this because only one-dimensional 
representations exist for this group. (2) Every wavefunction for an electronic state 
or orbital must be either symmetric or antisymmetric for inversion. Therefore, the 
following AO combinations are feasible for MOs insofar as symmetry is concerned: 


c(Isp, — 1sf,), 
c1(1sr, + Isp) + co (15e, + 1sc1,) + es (spr, + 1SBr;) 
t ca(1sc, + 1sc,) + ess, + 28p,) +++: 
The following are disallowed: 


clsp,, 
c1 (2sp, + 285p) + c2 sc, — 28c1,) 


Our halogenated ethane molecule is a good starting example because it belongs to 
such a simple group. But let us now return to the C3, group of the ammonia molecule 
and continue developing the relations between group theory and MO theory. For 
convenience, the molecule is again sketched (Fig. 13-9) to show its orientation with 
respect to Cartesian axes. (The z coordinate is the principal axis.) 
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> x 


Figure 13-9 » Orientation of a model of ammonia in Cartesian space. 


The normal practice in group theory is to use Cartesian coordinates or linear combina- 
tions of such coordinates as basis functions^ for generating many of the representations 
of a group. Therefore, we begin by examining the z coordinate to see what becomes of it 
under the various symmetry operations in the C3, group. The results are easily seen to be 


Ez =+1z, oz =4+1z, oz =+4+1z, 


O3z =+1z, Crs =tlz, Cyz=+4+1z 


Thus, z is a basis for the totally symmetric representation I) of Table 13-8. The 
coordinates x and y can also be used as bases. Here things get more complicated. 
Clockwise rotation of x by 27/3 radians (to give x^) causes it to end up in a posi- 
tion where it must be expressed as a resultant of both x and y (Fig. 13-10). Simple 
trigonometry requires that, for a general rotation through the angle $, the unit vector 
x’ has an x coordinate of cos $ and a y coordinate of sing. Similarly, a rotation of 
y to a new position designated y' must yield a new x coordinate of —sin $ and a new 
y coordinate of cos @. Thus, for a rotation through the angle $, we have 


c.l) x'| {ù xcosp+ysing 
$ y] Ay]  \—xsing+ ycos¢ 
y y 


C;* onx 


$ = —2n/3 or *41/3 
x^ 


Figure 13-10 » Result of clockwise rotation of x by 27/3 to produce the new vector x’. 


"The function corresponding to a coordinate is not exactly the same thing as the coordinate itself. The z 
coordinate is a ray running perpendicular to the x y plane through the coordinate origin. The function z is the 
altitude above (or below) the x y plane at every point, regardless of whether it is on the z axis; z is the square of 
the altitude at every point, etc. The behavior of these functions upon rotation, reflection, etc. is the same as that 
of the coordinate or product of coordinates. 
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The rotation operator Cg is thus represented as 


G= (13-1) 


In the case at hand, clockwise rotation by 27/3 radians is a decrease of 27/3 in @. 


Substituting $ = —27:/3 for C3+ and $ = +27 /3 for C37 gives us the following 2 x 2 
representations: 


-1 an mE 

Ct. 2 2 CL: 2 2 
M (oe 1 E Lu 
2 2 2 2 


For reflection oj, it is easy to see that x is unmoved and y goes into minus itself. 
Maintaining our dimensionality of two, this gives 


1 0 


“lo Se 


For 02, x, and y again move into positions x’ and y’, expressible as resultants of the 
original x and y vectors (Fig. 13-11). Thus, x’ = -ix + By, y= Bx + Ly, and o» 
has the representation 


_1 3 
2 2 
02 
43.1 
2 2 


E NEU 
2 2 
03: 
_ v3 1 
2 2 


The operation E does not move either x or y and is represented by the two-dimensional 
unit matrix. This completes our use of x and y, and we see that together they generate 
the two-dimensional representation 3. What should we use next? We have not yet 
generated I2, and so we know that we need to look for another basis. A basis that is 


Figure 13-11 » Result of reflection through the o? plane of x and y to produce new vectors x’ 
and y. 
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Figure 13-12 » Effect of a reflection on the direction of $. 


frequently employed in group theory is a direction of rotation about some symmetry 
axis. In our case, the symmetry axis is the z axis, and we let R, stand for a direction of 
rotation about this axis. (Which direction we choose, clockwise or counterclockwise, 
is arbitrary.) Now we consider how R, is affected by the symmetry operations. C 
and C; have no effect since they merely shift the origin of the $ coordinate but do not 
affect the direction in which $ increases or decreases. Put another way, the direction 
of motion of the hands of a clock is not affected by rotating the clock about an axis 
perpendicular to its face. However, reflections o1, 02, and o3 will cause the direction 
to be reversed (Fig. 13-12). Therefore, 


ER,=+1k,, o,R,=—I1k,, o2R,=—1R, 
o3R,=—1R, CzR,=+1R, Cy R,=+1R, 


and we see that R, is a basis for T2. 


13-9 Labels for Representations 


Thus far, we have labeled our representations I1, I2, etc. We will now describe rules 
for a more meaningful symbolism—one that has become standard. The rules are as 
follows: 


1. A one-dimensional representation is given the main symbol A or B. A is used if the 
representation is symmetric for rotation by 27x /n about the n-fold principal axis, B 
if it’s antisymmetric. (For C1, C,, and C; groups, which have no principal axis, the 
symbol is always A.) A two-dimensional representation has the main symbol Æ. 
Three-dimensional representations are symbolized T (or sometimes F), and four- 
dimensional representations are symbolized G. 


2. Subscripts may be applied as follows. If there are C? axes perpendicular to the 
principal axis, then a subscript 1 (2) means that the basis for the representation is 
symmetric (antisymmetric) for such a rotation. In the absence of such C2 axes, 
vertical reflection planes are used instead, if present. If there is an inversion center, 
subscripts g (gerade) and u (ungerade) refer to the basis for the representation being 
respectively symmetric or antisymmetric for inversion. 


3. Superscripts may be applied as follows: If there is a oy plane, a single prime means 
the basis for the representation is symmetric for that reflection; a double prime 
means it is antisymmetric. 
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TABLE 13-13 » Representation Table for the C3, Group 


Cay E 01 02 03 
Ai 1 1 1 1 
As 1 xii = zai 


-i -48n 
—J/3/2 1 


Use of these conventions enables us to write our C35, representation as shown in 
Table 13-13. (The symbol £ in the left-most column of the table should not be confused 
with the E inthe top row. The former labels a two-dimensional representation, the latter 
refers to the identity operation.) At the right-hand side of the table are listed, for each 
representation, the bases described above plus a few others. It is convenient for chemical 
applications to list all Cartesian combinations up to the second power, as has been done 
here. One can go to powers as high as one pleases (?? is a basis for 41), but first and 
second powers are of most common practical use in chemistry. The use of parentheses 
and commas for bases for the E representation [e.g., (xz, yz)] indicate which pairs of 
functions may be selected as bases for this two-dimensional representation. 


N 13-10 Some Connections Between the Representation 
Table and Molecular Orbitals 


Itis possible, by inspecting Table 13-13, to predict certain properties of MOs or wave- 
functions for a molecule having C3, symmetry. Suppose that we did an MO calculation 
on ammonia. What does this table tell us to expect? In the first place, it tells us that 
there are only three MO symmetry types possible. We might find nondegenerate MOs 
having A; symmetry (totally symmetric), or 42 symmetry (antisymmetric for reflec- 
tions). It is also possible for MOs to exist that form bases for E representations. Since 
such bases are intermixed by some operations, however, these MOs cannot be either 
symmetric or antisymmetric for every operation. Therefore, they must be degenerate. 
And, since the intermixing occurs only within pairs of such bases, the MOs must be 
doubly degenerate. Thus, inspection of the representation table tells us at once that non- 
degenerate and doubly degenerate MOs are possible for ammonia. Since the symmetry 
requirements apply to wavefunctions as well as to one-electron MOs, the table tells 
us also that ammonia can have states with wavefunctions whose spatial symmetries 
are 41, 45, or E. It is conventional to label MOs with lowercase symmetry symbols, 
41, a2, €, and state functions with uppercase symbols. 

The representation table also tells us which basis AOs can appear in the various 
MOs. For instance, if we used a set of valence AOs for NH3, we would have a 2p, 
AO on nitrogen, oriented along the C3 axis of the molecule, and also 2p, and 2p, AOs 
perpendicular to C3. Now 2px, 2p,, and 2p; transform like x, y, and z, respectively 
(if they are centered at a common point on the C3 axis); just as z is symmetric for 
all symmetry operations of the group, so is 2p,. This means that the 2p, AO can be 
expected to appear only in MOs with a; symmetry. 2p, and 2p, must appear together 
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C3* C37 
1 1 zx +y,z 
1 1 R2 
-} 3n -$ -43n 
2 2 dT yx 
(_ 3/2 E ) e -1 (x, y) CRy, Ry)(x y^, xy)(xz, yz) 


in e-type MOs. What about 2s on nitrogen? Intuitively, we know that it, like 2pz, is 
unaffected by all the operations, and so it should appear only in a; MOs. The table 
indicates this by listing z? and x? + y? as bases for the A, representation. Since these 
have the same symmetry, their sum retains the symmetry, and so x? + y? + 2 =r? is 
also a basis for 41. Since r? is spherically symmetric, this indicates that any spherically 
symmetric function on nitrogen is a basis for the A; representation. 

If we were to use an expanded basis set of AOs, including 3d AOs on nitrogen, 
the group table tells us that 3d,2 would appear in the nondegenerate a; MOs and that 
3d,2..,2, 3d, y, 3dxz, and 3d), would appear in degenerate e-type MOs. (Recall that 
3d» is really 342. , or3d» 2 ,» 2.) 

What about the 1s AOs on the H’s? Can the representation table tell us in which MOs 
these will appear? It turns out that it can, and that they go into both a1- and e-type MOs, 
but we will defer showing how this can be told from the table until later in the chapter. 

Readers may begin to appreciate the usefulness of group theory in quantum chem- 
istry when they consider that, simply by assigning ammonia to the C3, group and 
looking up the representation table, we are able to say that a minimum-valence basis 
set MO calculation will produce nondegenerate, totally symmetric (a;) MOs contain- 
ing N2s, Nop,, and Hj; AOs, and doubly degenerate (e) MOs containing Np, , N2p,, 
and H4, AOs. (No a? MOs will appear because none of our AOs are a basis for that 
representation.) 


N 13-11 Representations for Cyclic and Related Groups 


Cyclic groups are the groups C2, C3, C4, ..., C, containing only the n — 1 rotation 
operations and the identity operation Æ. We devote a separate section to these because 
there are some special problems connected with finding and labeling representations 
for these groups. (This section is off the mainstream of development of this chapter 
and may be skipped if desired.) 

Let us consider the operations associated with the n-fold proper axis oriented along 
the zaxis and ask what will become ofthe function f — exp(i$) as itis rotated clockwise 
about this axis by 27x /n radians. (We entertain the idea that exp(i 9) might be a conve- 
nient basis for a representation since such functions were found to be eigenfunctions for 
the particle-in-a-ring problem in Chapter 2.) Since the clockwise direction is opposite 
to the normal direction of the $ coordinate, the effect of the rotation is to put f (p) where 
f (à — 27 / n) used to be. To see how the function after rotation compares to that before 
rotation, we must compare exp(7@) with exp[i(@ — 27x /n)]. That is, the representation 
Rp, such that C+ exp(iQ) = Rrexp(iQ), is given by exp(i9)/ exp[i($ — 27/n)], or 
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TABLE 13-14a » Partial Representation for the C3 Group 


G| E G C 


A 1 1 1 z, Rz 
1 exp(27i/3) exp(4zi /3) expG o) 


TABLE 13-14b » Partial Representation for the C3 Group 


Ci| E C3 C2 | e-expQzi/3) 
A 1 1 1 z, R; 
1 € e* exp(id) 


exp(27i/n), which equals cos(27/n) +isin(22/n). For various fractions of a cycle 
(i.e., various n), R y takes on different values: 


n: 1 2 3 4 5 6 7 8 
Ry: 1 —1 expQzi/3) i expQzi/5) expGri/3) expQzri/7) expGri/4) 


The important point here is that f — exp(i$) is a basis for a one-dimensional represen- 
tation for C, since a rotation turns f into a constant times f and not into some other 
function. For the C3 group, then, we could write a partial representation table as shown 
in Table 13-14a. Now exp(4i/3) is equal to exp(—27:i/3), and exp(—27:i/3) is the 
complex conjugate of exp(27i/3). If we let e = exp(27i/3), we can write the table as 
shown in Table 13-14b. If f = exp(i$) is a satisfactory basis, f* = exp(—i4$) is also 
acceptable, since it is linearly independent of f. If one calculates the representations 
for n — 1, 2,3, ... as before, one finds that the numbers R y» are the complex conjugates 
of Ry. Therefore, we can immediately expand our table as shown in Table 13-14c. 
Since the existence of exp(i@) as a basis for a one-dimensional representation always 
implies that exp(—i@) exists as a basis, these sorts of one-dimensional representations 
always occur in pairs. It is conventional to combine these with braces and refer to 
them with the symbol E, which we claimed earlier is reserved for two-dimensional 
representations. Note, however, that a pair of one-dimensional representations is not 
the same as a two-dimensional representation, and we must broaden our definition of 
the symbol £ to include both types of situation. 

Our representation table for the C5 group now looks almost the way one would find 
it in a standard tabulation. However, instead of listing exp(i$) and exp(—i@) as bases, 


TABLE 13-14c » Representation for the C3 Group 


G| E C3 C3? € =exp(2xi /3) 


A 1 1 1 Z, Rz 
E I € “| exp(i$) 
€ exp(—iQ) 
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TABLE 13-15 » Representation for the C3 Group 


the convention is to list x and y. In relating Cartesian to spherical polar coordinates, 
x =r sinf cosġ and y=rsin@ sing. When we are concerned only with changes in 
h, x goes as cos $, y as sing, and, since sin $ and cos ¢ are expressible as linear com- 
binations of exp(; 9) and exp(—i@), it follows that x and y are equivalent to exp(+i@) 
as bases. If we had started out with x and y as bases, we would have found that, for 
rotations, these are intermixed, leading to a truly two-dimensional representation. In 
fact, we worked out the effects of C3 and C 5 on x and yearlier for the C3, group. There 
we found that C4 and c3 (= C4 ) were represented by the two-dimensional matrices 
shown in Table 13-15. But this E representation is reducible to the two one-dimensional 
representations through the unitary matrix 


3 
MM os MM -e 0 


which can be separated into two one-dimensional representations: 


E C3 c3 

1 1 iv3 1, iv3 
2 2 2 2 

1 1l, iv3 1. iVv3 
287 2 2 


Since e = exp(27i/3) =cos(27/3) + i sin(27/3) = -j + SEN we recognize that this 
pair of one-dimensional representations is the same as the pair we found earlier. Fur- 
thermore, we note that U is precisely the transformation that turns x (1.e., cos @) and y 
(i.e., sin @) back into exp(+i@) (to within a constant multiplier): 


x 1 [1 i cos $ 1 f[cos@+ising (1/V2) expli) 
u| | =— oe equos md =. 
y J2\i 1 sing V2 Ai cos $ -- sin ó (—i/A2) exp(—iQ) 
Thus, (x, y) produce a reducible, two-dimensional representation equivalent to the 
irreducible representation given by exp(+i@). The reason for listing (x, y) as bases 
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TABLE 13-16 »  Irreducible Representations for the C3 Group 


G| E G 6; e = exp(27i/3) 
A 1 1 1 z, R- Ex HVX Xy 
2 1 € e* (x, y) 

1 €* € (Rx, Ry) (yz, xz) 


is simply that most applications of the table are made to real functions (e.g., chemists 
usually prefer to work with 2p, and 2p, AOs rather than with 2p..; and 2p_;). Our 
final form for the table, then, is that shown in Table 13-16, where additional bases have 
been listed. 

The preceding discussion has been within the context of the C, groups. One might 
ask what sort of symmetry operation is needed to make it impossible for exp(i@) to 
be a basis for a one-dimensional representation. The answer is, any operation that 
causes a reversal in the direction of the coordinate $. For then, exp(i@) — exp(—i@), 
and our basis function has turned into another independent function rather than into a 
constant times itself. Therefore, the presence of any symmetry operation that reverses 
the direction of motion of the hands of a clock will suffice to prevent representations of 
the e, €* sort. Operations that reverse clock direction are oq, oy, and C5 (perpendicular 
to the principal axis). Clock direction is unaffected by op, i, and S,. Therefore, we 
can expect e, €* types of representations to occur in groups of types Cn, Cun, Sn, all of 
which have a C, axis but no og, oy, or C elements. 


|J 13-12 Orthogonality in Irreducible Inequivalent 
Representations 


We come now to a very important point regarding representations. We will illustrate 
our arguments with the representation table (Table 13-13) for the C3, group. Notice 
the following features of that table: 


1. If we choose the A, representation, square all the numbers, and sum over all six 
symmetry operations, we get 6 as a result. 


2. If we do the same thing with the 42 representation, we get the same result. 


3. If we do the same thing for the upper left-hand elements (the 1, 1 elements) of the 
E representation, we get 3 as a result. 


4. If we do the same thing for each of the other positions in the E representation, the 
result is 3 each time. 


In general, the result of this procedure for any irreducible representation in any 
group will be the order of the group divided by the dimension of the representation. 'That 
is, if ^ is the order of the group, l; is the dimension of representation T;, and re B (R) 
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is the number in the (j, k) position of the matrix representing symmetry operation R, 
then the mathematical formula that corresponds to our general statement is 


rp} ea 
R l 


Note that the absolute square is used to accommodate the e, €* type representations of 
cyclic groups. 

We can conceive of the set of six numbers for 4, as being a vector of six elements. 
The six numbers of A4» constitute a second vector, and E provides four more six- 
dimensional vectors. If each such vector is multiplied by .//;/h, then each vector is 
normalized. 

Now let us examine these vectors regarding their orthogonality. If we take the scalar 
product of the unnormalized vectors A; and Az, the result is zero: 


—1 
(111111) =( 
— —— 

Aj 1 


A2 


The reader may quickly verify that the scalar product of any two different vectors from 
among the set of six in the C3, representation table is zero. Thus these six vectors are 
orthogonal. Once again, this result always holds between "representation vectors" in 
irreducible inequivalent representations for any group. Combining this orthogonality 
property with the normality property mentioned earlier, we have 


Y [uniri oo [viritim ao] = 8i, js (13-3) 
R 


where T; and I"; are understood to be irreducible and, if i Æ j, inequivalent. This 
relation, sometimes called “the great orthogonality theorem,” is of central importance 
in group theory. Its essence is captured by the statement that “irreducible inequivalent 
representations are comprised of orthogonal vectors." We do not prove the theorem in 
this book,’ but we do make considerable use of Eq. (13-3). 

One immediate result of the relation is that it enables us to tell when we have 
completed the task of finding all the inequivalent irreducible representations of a group. 
If we consider the C3, group, for example, we note that it is of order six, since there 
are six symmetry operations. This means that each "representation vector" will have 
six elements, i.e., is a vector in six-dimensional space. The maximum number of 
orthogonal vectors we can have in six-dimensional space is six. Therefore, the number 
of representation vectors cannot exceed the order of the group. Furthermore, since 
the number of vectors provided by an n-dimensional representation is n? (e.g., E is 
two-dimensional and gives four vectors), we can state that the sum of the squares of the 


5See Bishop [1] or Eyring et al. [2, Appendix VI]. 
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dimensions of the inequivalent irreducible representations of a group cannot exceed 
the order of the group. 

In fact, it can be proved? that this sum of squares of dimensions must equal the order 
of the group when all such representations are included. That is, 


all inequivalent 
irreproducible representations 


y p =h (13-4) 
i 
Thus, the fact that the squares of the dimensions of the 41, 42, and E representations 
for the C3y group add up to six, which is the order of the group, indicates that no more 
irreducible representations exist (except those that are equivalent to those we already 
have). 


EXAMPLE 13-6 Count up all the symmetry operations you can think of for (planar) 


BH3. Can you guess from this how many irreducible representations exist for this 
molecule and what their dimensions are? 


SOLUTION » The operations are: E, C. C3 , 01, 02, 03, Oh, SE. S3 , C2, Cs C; or 12 opera- 
tions, so the group order is 12. There must be at least one irreducible representation of order one, so 
we cannot have three two-dimensional representations. We could have: 12 one-dimensional repre- 
sentations; eight one-dimensional and one two-dimensional representations; four one-dimensional 
and two two-dimensional representations. From what has been presented so far, all three of these 
are possible. 4 


NH 13-13 Characters and Character Tables 


Thus far, we have defined representations and shown how they may be generated from 
basis functions. We have distinguished between reducible and irreducible representa- 
tions and have indicated that there is an unlimited number of equivalent representations 
corresponding to any given two- or higher-dimensional representation. An example 
of a pair of equivalent, reducible, two-dimensional representations, derived in Sec- 
tion 13-11, is given in Table 13-17. Equivalent representations are related through 
unitary transformations, which are a special kind of similarity transformation (see Chap- 
ter 9), and two matrices that differ only by a similarity transformation have the same 


TABLE 13-17 » Equivalent Representations for the C3 Group 


C3 E C3 C3 

- 10 -4 -43/ -4 3/2 

i 01 43/2. -4 -43/2 —4 
10 -4 — iv3/2 0 -4 i 3/2 0 


D AD 
epte) ENT 0 —4+i/3/2 0 —4-iv3/2 
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TABLE 13-18 » Characters for the C3, Group 


C3y E 01 02 03 C3t C37 

A1 1 1 1 1 1 1 zx +y,z 
Ay 1 1 1 1 1 1 Rz 

E 2 0 0 0 -1 —1 (x, y)(Rx, Ry) 


(x? — y*, xy)(xz, yz) 


trace, or character (Problem 9-7), which is defined as the sum of the diagonal ele- 
ments of a matrix. The matrices in Table 13-17 exemplify this fact, their characters 
being respectively 2, —1, —1 for E, C3, and c3 in both representations. The generally 
accepted symbol for the character of operation R in representation T; is x;(R), and the 
mathematical definition is 


xi(R) = Y TÍ (R) (13-5) 


J 


In practice, it is the characters of irreducible representations that are used in most 
chemical applications of group theory. This means that one needs only the character 
table for a group, rather than the whole representation table. For the C3, group, the 
character table is displayed in Table 13-18. Comparison with Table 13-13 will make 
clear that the character is merely the sum of diagonal elements. (For one-dimensional 
representations, the character and the representation are identical.) Since the represen- 
tation for the identity operation is always a unit matrix, the character for this operation 
is always the same as the dimension of the representation. Hence, the first character in 
a row tells us the dimension of the corresponding representation. 

An immediate consequence of the orthogonality theorem for representations is that 
the vectors resulting from characters are orthogonal too. This is trivially obvious for 
characters of one-dimensional representations. For multidimensional representations, 
the character vectoris simply the sum of the representation vectors in diagonal positions. 
If a given vector (say, the 41 vector) is orthogonal to each of these (say, E11 and E22), 
then it is orthogonal to their sum; that is, in terms of the vector notation of Chapter 9, 
if à(e1; =0 and a; €22 = 0, then a (€11 + €22) = 0. 

Inspection of Table 13-18 reveals a curious thing. For any given row of characters, 
all operations in the same class have the same character. There is a fairly simple reason 
for this. We have indicated already that operations in the same class are operations 
that can be interchanged merely by group reflections, rotations, etc., of the symmetry 
elements in the group, but we have seen that such changes are mathematically effected 
through similarity transformations. This means that representations for operations in 
the same class are interchangeable via similarity transformations. That is, the matrix 
representing, say, o1 (in the E representation of C3,) can be made equal to the matrix 
representing o» through a similarity transformation: 


T laoiT2o; (13-6) 


(From our group Table 13-7, we can ascertain that T must be the matrix represent- 
ing 03.) Now, the two sides of Eq. (13-6) must have the same character since they 
are identical 2 x 2 matrices, but the left-hand side must have the same character as o 
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TABLE 13-19 » The Standard Short-Form Character Table for the C3, Group 


C3y E 30 2C3 

Ai 1 1 1 zx +y, 

Ay 1 E 1 R, 

E 2 0 -1 (x, y) Ry, Ry) (x? — y?, xy) (xz, yz) 


since character is unchanged by a similarity transformation. Therefore, o1 and o» have 
the same character. 

We can take advantage of the above rule to write our character table in abbreviated 
form, illustrated for the C3, group in Table 13-19. This is the standard form for character 
tables. A collection of such tables appears in Appendix 11. 

That characters must be equal in the same class is a restriction on our character 
vectors. In Table 13-19 it is made evident that, in the C3, group, our vectors really only 
have three independent variables each, one for each class. These are properly thought 
of, then, as vectors in three-dimensional space (with weighting factors 1, 3, and 2 for 
E,o, and C3, respectively). There can be no more than three such vectors that are 
orthogonal, and so we are left with the result that the number of inequivalent irreducible 
representations in a group cannot exceed (and is in fact equal toô) the number of classes 
in the group. This result, together with the fact that the sum of squares of dimensions of 
inequivalent irreducible representations must equal the order of the group, often suffices 
to tell us in advance how many representations there are and what their dimensions 
are. For our C3, group, the order is six and there are three classes. Hence, we know 
that there are three representations and that the squares of their dimensions sum to six. 
The problem reduces to: “What three positive integers squared, sum to six?" There is 
only one answer: 1, 1, and 2. The fact that there will always be a totally symmetric 
one-dimensional representation also helps pin down the possibilities. For example, can 
one have a group of order eight and only two classes? There is no way this can happen. 
Two classes would mean two representations. If both were E type, their dimensions 
squared would indeed sum to eight. But one of them must be one-dimensional, and 
there is no way the other can square to seven. 


EXAMPLE 13-7 How many classes of symmetry operation can you count for (pla- 


nar) BH3? Can you use this to select one of the possibilities from Example 13-12? 


SOLUTION »  Theclasses are E, oy, C3, C2, 05, Sa. That's six classes, so there are only six irre- 
ducible representations. There are four one-dimensional and two two-dimensional representations. 
< 


Our collected list of conditions that the characters in a completed table must satisfy 
is as follows: 


1. There must be a one-dimensional representation having all characters equal to +1. 


2. The leading character in each row (i.e., the character for operation E£) must equal 
the dimension of the representation. 


6See Bishop [1]. 
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3. The sum of the squares of the leading characters must equal the order of the group. 


4. The number of rows in the character table must equal the number of classes in 
the group. 


5. The absolute squares of the characters in a given row (times the weighting factor for 
each class if the abbreviated form is used) equals the order of the group. (Character 
vectors are normalized) This results directly from Eq. (13-2). 


6. The character vectors are orthogonal (again, using weighting factors, if appropriate). 


This is a fairly large number of restrictions, and may suffice to allow one to produce 
the character table for a group without ever actually producing representations. For 
example, consider the C4, group, which has the operations £, 2C4, C2, 20x, and 204. 
The group thus has order eight and five classes. There must be five representations, and 
the only way their dimensions can square to eight is if four of them are one-dimensional 
and one is two-dimensional. This already enables us to write Table 13-20. It is not 


TABLE 13-20 » Partial C4, Character Table 


Cay E 2C4 C5 20y 204 
Ay 1 1 1 1 1 
P 1 

I? 1 

I3 1 

E 2 


difficult to find a way to make I; orthogonal to 4}. We simply place —1 in some 
places to produce four products of —1 and four of +1. Three possibilities are shown in 
Table 13-21. These are orthogonal not only to 41, but to each other as well, and so we 


TABLE 13-21 » Partial C4, Character Table 


E 2C4 C2 20y 20q 
1 1 1 —| —| 
1 —| 1 1 —| 
1 —| 1 —| 1 


have found the characters for r1, M2, and r3. The characters for the E representation 
must have squares that sum to eight and also be orthogonal to all four one-dimensional 
representation vectors. One possibility is fairly obvious. Since the characters for 
operations E and C2 are +1 in all the one-dimensional representations, we could take 
the characters for the E representation to be 


E 2C4 Cı 2o, 204 
2 0 -2 0 0 
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Other cases that meet the normality condition are 


2 +1 0 x1 0 
2 0.0 +1 1 
2 +1 0 0 cl 


But none of these is orthogonal to all the one-dimensional sets. Therefore, the complete 
character table (except for the basis functions) for the C4, group is shown in Table 13-22, 
where the symbols 42, B1, B2 are consistent with symmetry or antisymmetry for C4 
and o, as described in Section 13-9. 


TABLE 13-22 » Completed C4, Character Table 


Cay E 2C4 C2 20y 208 
Ay 1 1 1 1 1 
A5 1 1 1 —1 —1 
Bı 1 —1 1 1 —1 
Bo 1 —1 1 —1 1 
E 2 0 —2 0 0 


E 13-14 Using Characters to Resolve Reducible 
Representations 


It was pointed out earlier that several irreducible representations can be combined into 
a larger-dimensional reducible representation. Our example was 


D, 0 0 45 0 0 
0 n 0ļ=|0 4 Of=r’ 
0 0 T} 0 0 E 


which is a four-dimensional representation (since I3 is two-dimensional). A matrix 
built up in this way is symbolized 42 © A; ® E. It is easy to see that the characters 
of the reducible representation T” are simply the sums of characters for the individual 
irreducible component representations (since the diagonal elements of 42, 41, and E 
all lie on the diagonal of I^). Thus, the characters of I” are 


E 3o 2C3 
I’: 4 0 1 


Furthermore, no matter how T” is disguised by a similarity transformation, its character 
vector is unchanged. Now, suppose you were given the representation I’, disguised 
through some similarity transformation so as to be nonblock diagonal and asked to tell 
which irreducible representations were present. How could you do it? One way would 
be to find the similarity transformation that would return the representation to block 
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diagonal form. But there is a much simpler way. We can test to see if the character vector 
of T” is orthogonal to the character vectors of each of our irreducible representations. 
For instance, if I" contained only 4» and Æ, its character vector would be orthogonal 
to that of 4; because the character vectors of 45 and E are orthogonal to that of 4. 
If A, is present in I", then T” and A, character vectors are not orthogonal. Indeed, the 
amount of A,, A», or E present can be found by making use of the character vector 
normality relation. This is illustrated for I” by the following equations: 


1 
E a 71 


1 
Ae a 51 


1 
e E a 51 


In general, for =c r: aRar @-:- Pcl; G4 
c= (1/h) 9 xi(R)X'(R) (13-7) 
R 
where / is the order of the group. This technique for resolving a reducible representation 


into its component irreducible representations is very useful in quantum chemistry, as 
we shall see shortly. 


NH 13-15 Identifying Molecular Orbital Symmetries 


We stated earlier that any MO must be a basis for an irreducible representation. Given 
a set of computed MOs, how does one decide which representation each MO is a 
basis for? One does this by comparing the MOs with the characters in the character 
table. Some examples will make this clear. In Table 13-23 are extended Hückel data 
for NH3, oriented as shown in Fig. 13-13. NH3 belongs to the C3, group, having 
the character table shown in Tables 13-18 and 13-19. The minimal valence basis set 
of seven AOs leads to seven MOs. Notice that MOs 5 and 6 and also 2 and 3 are 
degenerate. Therefore, these MOs must be bases for two-dimensional representations, 
and are assigned the symbol e (lower case for MOs). The other MOs are all given the 
main symbol a. There are two possibilities for these MOs—a or a». These differ in 
their characters for reflection, a; being symmetric, and a» antisymmetric. If we look 
at the eigenvectors for MOs 1, 4, and 7, we see that they contain the 1s AOs on each 
hydrogen with equal sign and magnitudes. Since reflection always interchanges two 
hydrogens, these MOs are clearly all symmetric for reflection, and so we label them 
all aj. Our result, then, is 


MO: 1 2 3 4 5 6 7 
Symmetry: aj e e aj e e ay 


Next, consider staggered ethane, which we have earlier assigned to the Dsq point 
group. Orbital energy levels and sketches of the MOs appear in Fig. 13-14. The 
character table for the D3q group is given in Table 13-24. 

As before, we observe that certain of the orbitals have the same energies, so we assign 
such doubly-degenerate MOs the main symbol e. These MOs are either symmetric or 
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TABLE 13-23 » Extended Hückel Data for NH3 


Chapter 13 Group Theory 


MO no. MO energy (a.u.) MO occupancy 

1 0.7494 0 

2 0.1279 0 

3 0.1279 0 

4 —0.4964 2 

5 —0.5955 2 

6 —0.5955 2 

7 — 1.0178 2 

Eigenvectors 

AO\MO 1 2 3 4 5 6 7 
N(2s) 1.2946 0.0000 0.0000 —0.1715 0.0000 0.0000 0.7387 
N(2p;)  —0.4369 0.0000 0.0000 —0.9628 0.0000 0.0000 0.0214 
N(2px) 0.0000 1.0275 0.0000 0.0000 0.6498 0.0000 0.0000 
N(2py) 0.0000 0.0000 —1.0275 0.0000 0.0000 0.6498 0.0000 
H;(1s | —0.7166 —1.0399 0.0000 0.0656 0.4422 0.0000 0.1561 
Ho(1s —0.7166 0.5200 0.9006 0.0656 —0.2211 0.3829 0.1561 
H3(1s) | —0.7166 0.5200 —0.9006 0.0656 —0.2211 —0.3829 0.1561 


antisymmetric for inversion and are accordingly subscripted g or u, respectively. All of 
the nondegenerate MOs must have the main symbol a, since b does not appear in the 
Dag table. The character table indicates that a; and a» differ in that they are respectively 
symmetric and antisymmetric for two-fold rotations that switch the molecule end for 
end. The u, g subscripts again refer to inversion. Inspection of the figures enables 
us to decide which symbols are appropriate in each case. The resulting symmetry 
designations are included in Fig. 13-14. 


x 


Figure 13-13 » Orientation of ammonia with respect to Cartesian axes. 
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Figure 13-14 » Valence MOs for staggered ethane. The energy level spacings have been altered 


for convenience. The hatched areas have a negative sign. 


TABLE 13-24 » Characters for the D3q Point Group 


D34 E 2C; 3C2 


N 
s 
Q2 
2 


Aig 1 1 1 1 1 1 
Arg 1 1 E 1 1 E 
E MESS 0 2 =j 0 
Aiu 1 1 1 1 1 1 
Aou 1 1 1 1 1 1 
Ea 2 =l 0 =) 1 0 


Orbital Will Appear 


E 13-16 Determining in Which Molecular Orbital an Atomic 


Earlier, we noted that the 2p, AO in ammonia will contribute to a; MOs because 


2p. transforms like z, and z is listed as a basis for the a, representation. 


If the 


basis functions were not listed, we could have reached the same conclusion simply by 


466 Chapter 13 Group Theory 


Figure 13-15 » 


taking 2p; and putting it through all the symmetry operations to produce a representa- 
tion: 


E2p; = +1-2p, 


0i2pz; =+1-2p,, i=1,2,3 
C3 2pz = +1- 2p, 


The representation contains only +1, so 2p, obviously “has” a; symmetry. 

When we come to the 1s AOs on hydrogens in NH3, we cannot use the list of 
basis functions. It includes only coordinates originating on the principal axis, and the 
hydrogens are not on that axis. Here we must generate a representation. Let us try 
to do this by putting one of the hydrogen atoms, H1, through the various symmetry 
operations (also see Fig. 13-15): 


EH —41H 
o Hi —41Hy 
oh= M 
o3 Hı = Hh 

CH. h 

C;Hi— Hs 


Since some of these operations interchange Hı with H or H3, we are not achieving 
a one-dimensional representation. We must take all three functions together and work 
out a three-dimensional representation. Thus, 


H; H; 100 
ElH,|-—lHi. E:10101, x(5—3 
Ha A 001 
H; H; 100 
oaj H5|—-|H[|. e:[001]., x@)=!1 
Hi A 010 
Hi Mh 010 
C |m |=|], soll.) )0 


H3 Hi 100 
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and similarly for 02,03, and C5 . But we are only going to use the characters x, and 
we know that o2 and 03 must have the same character as o; (i.e., 1) and C} must have 
the same character as C a (i.e., 0). Our character table for the representation resulting 
from the basis of three hydrogen 1s AOs then, is 


Notice that a “one” on the diagonal of a 3 x 3 representation matrix has the effect 
of keeping a hydrogen 1s AO in place. Thus, the E operation keeps all three H's 
unmoved, has three “ones” on the diagonal, and has a character of 3. The o operations 
each leave but one hydrogen unmoved, have a single “one” on the diagonal, and have a 
character of 1. C3 leaves no hydrogen unmoved and has a character of zero. In general, 
the characters of such representations are the numbers of functions not moved by the 
various operations. Thus, we could have written down the above characters for 377 
without figuring out the representation matrices. 

It is evident that I'3;z; must be reducible, since it is of higher dimension than anything 
in the C5, character table. To resolve 37 into its irreducible components, we use the 
relation (13-7): 


1 

Aye Srl S11 422011 
1 

Ap: 2(1-3-143-1--14+2-0-1)=0 
1 

E: -(1-3-2+3-1-0+2-0--I)=1 


Therefore, 34 = 41 ® E. We conclude from this that hydrogen 1s AOs can appear in 
MOs having a, or e symmetry. Table 13-23 indicates that this is correct. 


EXAMPLE 13-8 Given only the MO sketches of Fig. 13-14 (and no energies), is 


there an easy way to select the MOs that are degenerate? 


SOLUTION »  Nondegenerate MOs must be either symmetric or antisymmetric for every sym- 
metry operation of the molecule. Failure to obey this criterion indicates a degenerate MO. For 
instance, the two highest energy MOs in the figure are symmetric or antisymmetric for every 
reflection, rotation, improper rotation, or inversion, hence are nondegenerate. The next two MOs, 
however, are unsymmetrical for some of the operations. Most easily seen is their unsymmetrical 
behavior for rotation by 120? about the C-C axis. These MOs must be degenerate. 4 


E 13-17 Generating Symmetry Orbitals 
Consider the lowest-energy extended Hückel MO for NH3. It is 
7 = 0.7387 N5, + 0.0214 N2p, + 0.1561 1s; +0.1561 1s2 +0.1561 1s3 


We see that the hydrogen AOs all have the same sign and magnitude in this MO. 
In fact, we noted above that this must happen in all a; MOs of NH3 for reasons of 
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symmetry. If an MO is to be symmetric for all the reflection and rotation operations 
of the C3y group, there is no other combination that will be adequate. When we 
have several equivalent atoms, and symmetry forces their AOs to appear in MOs in 
certain combinations, we refer to those combinations as symmetry orbitals. Thus, 

A = N(1s; + 1s2 + 1s3) is the aj symmetry orbital for the hydrogens in ammonia 
(N is a normalizing constant). 

One can use the character table to generate symmetry orbitals. This is done by 
picking any one of the AOs, say 1s1, and operating on it with each symmetry operation 
times the character for each operation in the representation of interest. The sum of 
all these operations is an unnormalized symmetry orbital. Thus, for the a; representa- 
tion in NH3, 


$4 = E-1-1sj-F 211-151 +02: 1-151 4-03: 1: 1s; + CT - 1: 181 + C3 - 1-18) 
= 1si + is; + ls} + Iso + 1s2 + Is 
= 2(1s; + 1s2 + 153) 


If we normalize (ignoring overlap between 1s AOs on different centers), we 
obtain 


1 
Pa? = 505 + 1s2 + 1s3) 


To generate symmetry orbitals of e symmetry is a little more involved. First we pick 
a 1s AO and do just as before, using now the characters for e rather than a: 


pe” = E-.2-1s; +0-allreflections -+(-1)- Qs -Isı +(—1)-C; 1s 
— 21s, = 1s A 1s3 


Normalization yields 


/ 1 
pe" = zQ ls Iso — 153) 


Z 


Because e symmetry is manifested by doubly-degenerate MOs, we need to find a mate 
for oë. We can try to do this by repeating the above procedure except operating on 
1s? instead of 1s. This yields 


Viet. 1s; — 183) 
e Vb 2 1 3 


But this function is not orthogonal to $7 . (The overlap is — 1) To achieve orthogonality, 
we resort to Schmidt orthogonalization (Chapter 6): 


H' H H 
We = We Z Soe 
Upon expansion and renormalization, this gives 
1 


yË = Jg 05-19 
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The data in Table 13-23 show that the e-type MOs do contain the hydrogen 1s AOs in 
just the manner prescribed by symmetry. For example, MO 5 has hydrogen 1s coeffi- 
cients 0.4422, —0.2211, —0.2211, a combination similar to that in pE . The degenerate 
mate, MO 6, has hydrogen coefficients of 0.0000, 0.3829, —0.3829, similar to wi j! 
Chemists who have had some experience in these matters are likely to prefer thinking 
in terms of symmetry orbitals. Thus, in thinking of ammonia, they are likely to take 
as a minimal valence basis set the functions: M2s, N2p,, Np, Nap., As + 1s2- 


183), AAN 1s2 — 183), Ja (Ise 1s3). (We continue to ignore overlap between 
1s AOs.) Since they know that it is not possible for bases of different symmetries 
to mix (it would produce an MO of mixed symmetry), they know at once that they 
can have a; MOs from mixtures of N25, Nap, and (1/V3)(1s1 + 1s2 + 183) and e-type 
MOs from the remaining functions. In effect, they have used symmetry to partition 
their functions into two subsets that do not interact with each other. This means that, 
in the MO calculation, the hamiltonian matrix will have no mixing elements between 
members of different subsets. This is indicated schematically in Fig. 13-16. They also 
know in advance that there will be three MOs of a, symmetry (since only three basis 
functions have that symmetry) and four of e symmetry (two degenerate pairs). It is 
interesting to see how strongly symmetry controls the nature of NH3 MOs. 

Because symmetry orbitals depend on symmetry and not on finer details of molecular 
structure, they recur again and again in molecules of similar symmetry. For instance, 
the a; and e combinations of hydrogen coefficients discussed above for ammonia will be 
found for hydrogen AOs in staggered or eclipsed ethane (see the drawings in Fig. 13-14, 
for instance) and for carbon AOs in MOs for cyclopropenyl (Chapter 8). Because sym- 
metry requirements transcend the differences between various approximate methods 


a e 
SE 
Nop, $3 Nip, Nip, o" y” 
Nas = 
a Nip, Nonzero (a, , a; ) Zero (a, , e) 
H 
$3. 
Nop 
Nap, 
d Zero (a, , e) 
o," 
y” 


Figure 13-16 » Partitioned hamiltonian matrix. 
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for solving the Schródinger equation, we expect these symmetry patterns to appear 
in MOs for, say, ammonia, at extended Hückel, CNDO, INDO, MINDO, or ab initio 
levels of computation. 


E 13-18 Hybrid Orbitals and Localized Orbitals 


The concept of a hybridized orbital is often encountered in the literature, especially 
in introductory discussions of bonding theory. While this concept is not essential to 
MO theory, it is used enough to justify a brief discussion. 

In the previous section, we showed how one could transform from a basis set of 
STOs to a basis set of symmetry orbitals. Since these two sets are related through 
a unitary transformation, they are equivalent and must lead to the same MOs when 
we do a linear variation calculation. However, there are an infinite number of unitary 
transformations available, and so the set of symmetry orbitals is only one of an infinite 
number of possible equivalent bases. Of course, this set has the unique advantage of 
being a set of bases for representations of the symmetry group, which makes it easy 
to work with. Another set of equivalent basis functions are the hybrid orbitals. These 
have the distinction of being the functions that are concentrated along the directions of 
bonds in the system. Consider, for example, methane, which was discussed in detail in 
Chapter 10. The minimal basis set of valence STOs on carbon can be transformed to 
form four tetrahedrally directed hybrids: 


1 1 
$1 = 5(8+Px — Py + P2), $2 = 5 (8 = Pr + Py + Pz) 


$5 = j6 —Px—Py—Pz), b4= SG. + Py — Pz) 

One of these hybrids, $4, is shown in Fig. 13-17 and can be seen to point toward one 
of the hydrogen atoms. Because the square of each hybrid consists of one part s AO 
to three parts p AO, these are called sp? hybrids (pronounced s-p-three). The reason 
for focusing on sp? hybrids in this case is that they have physical appeal since they 
point along the C-H bonds and therefore seem to have a more natural relation to the 
electron-pair bond approach of G. N. Lewis. This is deceptive, however, because the 
set of four carbon sp? orbitals is completely equivalent to the set of four carbon STOs. 
The sum of squares of the hybrids is spherically symmetric just as is the sum of squares 
of STOs. Thus, even though each hybrid is directed toward a hydrogen, the electron 
density due to all four occupied hybrids is spherically symmetric. Furthermore, after 
the linear variation is performed, the MOs that are produced contain mixtures of hybrids 
to give us the exact same delocalized MOs produced from STOs. No single MO 
consists of just one hybrid and one hydrogen 1s AO, and therefore no single MO can 
be identified with one C-H bond. We conclude then that hybrid orbitals are one of an 
infinite number of choices of basis, that they have an appealing appearance because of 
their concentration in bond regions, but that no concentration of charge in the molecule 
results as a consequence of using hybrids rather than STOs. (Some concentration of 
charge in the bonds does result from overlap between basis functions on carbon and 
those on hydrogens, but this occurs to exactly the same degree for the various equivalent 
basis sets.) 
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$, vector x *y —z 


Figure 13-17 » A sketch of the hybrid $4 — 5(s + Px py — pz). The direction of the hybrid is 
coincident with that of a vector having the same x, y, z dependence as $4. The other three hybrids 
are identical except that they point toward the other three H atoms. (The coordinate system here is 
rotated with respect to that used in Chapter 10.) 


Another example is planar CHi (Dan). As before, we can mix our minimal valence 
STOs on carbon to produce hybrids pointing toward the hydrogens. If one hydrogen is 
on the + y axis, the hybrids are 


1 V2 


dum ru 

NE CES 

2 VB Jo" ie 
WE 1 1 1 

3 B x55 Ver” 


Each of these hybrids, when squared, is one part s to two parts p and is called an 
sp? hybrid. The coefficients for the p AOs are determined from simple vector consid- 
erations: One merely calculates how x and y vectors must be combined to produce 
resultant vectors pointing toward the corners of an equilateral triangle. The resulting 
hybridized basis set for CH; is VA, V2, V3, plus the 2p, STO on carbon and a 1s STO on 
each hydrogen. As before, the sum of the squares of y1, Y2, V3, and 2p; is spherically 
symmetric. 
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We turn now to localized orbitals. We have been emphasizing that one can subject 
basis sets to unitary transformations without making any physical difference. A similar 
rule applies for filled molecular orbitals in a determinantal wavefunction. These too 
can be subjected to unitary transformations without affecting the total energy or total 
electronic distribution for the system. (We have encountered this fact before. See, 
for instance, Appendix 7.) Thus, we have the capability of altering the appearance of 
the individual orbitals in the wavefunction without affecting the wavefunction itself. 
Chemists tend to think of the electrons in molecules as being paired in bond regions, 
lone pairs, and inner shells, but MOs are delocalized and do not reflect this viewpoint. 
But by carrying out unitary transformations, we can attempt to produce orbitals that are 
more localized without sacrificing any ofthe properties ofthe overall wavefunction. For 
methane, we could mix our four delocalized occupied MOs together to try to produce 
four new orbitals, each one concentrated in a different C-H bond region. One can do 
this, but it is important to realize that these localized orbitals are not eigenfunctions 
of an energy operator, for they have been produced by mixing eigenfunctions having 
different energies. Furthermore, the localization is never complete in any system of 
physical interest. Each localized orbital always contributes at least slightly to charge 
buildup in regions outside that of its primary localization. For instance, a localized 
C-H; orbital in methane will have small “residual” components at H2, H3, and H4. 

What we have been discussing in this section are various kinds of equivalent orbitals. 
At the level of basis sets, we have indicated that a minimal valence basis set of STOs 
is equivalent to a set of symmetry orbitals and also to a set of hybrid orbitals (as well 
as an infinite number of other possibilities). At the level of molecular orbitals we 
have indicated that the set of occupied delocalized MOs is equivalent to an infinity of 
transformed sets, some of which will tend to be localized in regions chemists associate 
with bonds, lone pairs, or inner shells. One's choice among the possibilities for basis 
is a matter of taste. However, at the MO level, the delocalized MOs have two features 
that are sometimes advantageous. The first is that their energies are eigenvalues for 
some energy operator (the Fock operator in SCF theory). These are related in a simple 
way to ionization energies and electron affinities via Koopmans' theorem. Hence, delo- 
calized MOs are more appropriate when considering photoelectron spectra, etc. The 
second advantage of delocalized MOs is that they display in a clear way the symmetry 
requirements on the system because they are bases for representations. Hence, these 
MOs are the most appropriate to use when one is using MO phase relations to infer 
the nature of certain intra- or intermolecular interactions. (See Chapter 14 for exam- 
ples.) When delocalized MOs are mixed to form localized orbitals, these energy and 
symmetry features become partially disguised. 


13-19 Symmetry and Integration 


Throughout this book, the usefulness of symmetry to determine whether an integral 
vanishes has been emphasized. It should come as no surprise, therefore, that the formal 
mathematics of symmetry— group theory—is also useful for this purpose. 

The basic idea we have used all along is that, if an integrand is antisymmetric for any 
symmetry operation, it must have equal positive and negative regions, which cancel on 
integration. If there is no symmetry operation for which the integrand is antisymmetric, 
then the integral need not vanish. (It still might vanish, but not because of symmetry.) 


Section 13-19 Symmetry and Integration 473 


The group theoretical equivalent of this is as follows. Suppose that we have an integral 
over the integrand f: 
J fdv=? 


The function f is identified as being related to some symmetry point group. (Examples 
are given shortly.) We want to know what representation f is a basis for. If f produces 
a representation containing 4,, then f has some totally symmetric character and the 
integral need not vanish. But if f is devoid of A; character, the integral vanishes by 
symmetry since all other representations are antisymmetric for at least one operation. 
Our problem, therefore, is to decide which irreducible representations are present in 
the representation that is produced by the integrand f. 

In quantum chemistry, the integrand of interest is often a product of wavefunctions 
(or orbitals) and operators. For example, the hamiltonian matrix Ê contains integrals 
of the form 


y= | vhs 


We know that É is invariant for any symmetry operation of the group, and so FH has 
A1 symmetry. yj and yy; are assigned symmetries by comparing their behaviors under 
various operations with the group character table, as illustrated earlier. Thus, it is fairly 
easy to ascertain the symmetries of the various parts of the integrand. The problem is 
to determine the symmetry of the product y? H Vj. 

To develop arule for products, let us considerthe simplest case—the one-dimensional 
representations. Suppose that yı and wz are bases for one-dimensional representations 
Tı and T2. Then, for some symmetry operation R 


Ryi;—xi(R)yi, Ry2= x2(R) v2 


where x is a character (1, —1, e, or €*). If we operate on the product y1 W2 with R, we 
obtain? 


Ryo = (GU) (Ryo) = x1(8)yn xo CR)yo = x1(R) xo (R) ni Y2 


That is, the characters for the product Vw» are equal to the products of the characters 
for 1 and y». We have demonstrated the rule for one-dimensional representations, 
but it can be proved for higher-dimensional cases as well. In group theory, the product 
of two functions, like V1 V», is referred to as a direct product to distinguish it from a 
product of symmetry operations, like o3C. "d . The symbol for a direct product is &. 
For the C5, group, the characters for some direct products of bases for irreducible 
representations are shown in Table 13-25. The direct product x? has as characters 
the product of characters of E times itself. These characters (4,0, 1) do not agree 
with any of the irreducible representation character sets, and so E & E is reducible. 
We can tell, in fact, that E & E is four-dimensional from the leading character. To 
resolve E & E, we employ the formula (13-7), which gives E & E = 41 8 42 & E, and 
fits the observation that E & E is four-dimensional. The other direct products listed 


7That Rw 2 = (Rw1)(Rw2) is not always obvious to the student, but it should be evident that operating on (say 
reflecting) the function w 1 Wf gives the same result as reflecting yy; and y» separately and then taking the product. 
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TABLE 13-25 » Characters for Direct Products of Bases for Irreducible Representations of C3, 


Ca, E 3o 2C3 
A1 1 1 1 z x2 + y, m 
Ad 1 ES 1 | R 
E 2 0 —1 | (œx, y), (Re, Ry) (x? — y?, xyY(xz, yz) 
EGE 4 0 1| x2 y eA xy 
AQE 2 0 —1 | Rax Rzy 
42 ® A2 1 1 1 | RŽ 
AIiQA4QE| 2 0 —1 | zR,x 


in Table 13-26 ( Rzx, R2, ZzRzx, etc.) all give character sets indicative of irreducible 
representations. We see that A2 ® E = E, 4289 A42 = A1, 428 AIQ E — E. 

There is a general rule that is illustrated by these examples: A direct product of 
bases for two irreducible representations contains A, character if and only if the two 
irreducible representations are the same. That is, if f; is a basis for I; and fj is a basis 
for I; and fj f; is a basis for I; j, where 


Dl; ;5SrliSUV;-coadi6o20--- Oc T;06c;U;$-- 


then cı £0 if and only if T; Tj. 

Another rule is that, if T; is 41, then T; ; =T j; that is, multiplying a function f, 
by a totally symmetric function fı gives a product with the symmetry of fo. 

Now we are in a position to decide whether the integral of a product of functions and 
operators will vanish. For our examples, we will continue to use orbitals, operators, 
and coordinates from the ammonia molecule. Some of these quantities, segregated 
according to symmetry, are given in Table 13-26. 


TABLE 13-26 » Operators and Orbitals for Ammonia Classified by Symmetry 


ai e 
Nos Nopx 

N2y; Napy 

(1/3) (11 + 182 + 153) (1//6)(2 - 1s; — 12 — 153) 
$1 

$4 t MOs(see Table 13-23) (1//2)(1s2 — 183) 

$7 $2 

Ê %3 | Mos 

B $5 


$6 
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EXAMPLE 13-9 Indicate whether each of the following integrals must vanish due 
to symmetry. 
1. J Nop, H Nop, dv 


2. J Nap, Hs (152 — 1s3)dv 


SOLUTION » 


1. f Np. A Nop, dv: The symmetries of the three functions in the integrand are respectively 
A1, A1, E. The direct product has symmetry E. There is no 44. The integral vanishes. 


2. f Nop, H gis — 1s3) dv: The symmetries are E, 41, E. The direct product is therefore 
EG EG 41 = A44 Q 42 Q E. Since A is present, the integral need not vanish. 


< 


These two examples are related to the block diagonalization of the matrix H, 
discussed in a previous section. The zero blocks in that matrix correspond to integrals 
between functions of different symmetry. Since Å is of A symmetry, it has no influ- 
ence on the symmetry of the integrand. If y; and y; have different symmetries, their 
direct product cannot contain A; symmetry and the integral over v; Hwy j must vanish. 
The reader can now understand how the computational procedure guarantees MOs of 
“pure” symmetry (i.e., bases of irreducible representations). If two basis functions vj 
and y; differ in symmetry, there will be a zero value for H;;. A zero H;; means that 
mixing vy; and y; will produce no energy lowering. Hence, the variation procedure 
will not mix these functions together in the same MO, so the MO will not be of mixed 
symmetry. 


EXAMPLE 13-10 Indicate whether each of the following integrals must vanish due 
to symmetry. 


1. f éixéadv 
2. f ósyós dv 


SOLUTION » 


1. {1x4 dv: Integrals of this sort are involved in calculating spectral intensities. The symme- 
tries are 41 ® E & A4 = E and the integral vanishes. This means a transition between states 
corresponding to an electron going from ¢ to $4 (or $4 to $1) is forbidden for x-polarized light 
and an oriented molecule (see Section 12-9). 


2. f $3yós dv: The symmetry here is E & E Q E, which gives characters 8, 0, — 1. This resolves 
into A; 45 3E and the integral need not vanish. Corresponding transitions are “y allowed.” 


< 


EXAMPLE 13-11 In Chapter 12 , we indicated that the 7* < ~ transition in ethene 


is dipole-allowed and polarized parallel to the molecular axis. Verify this from the 
character table for ethene, after determining the group symbol. 


476 Chapter 13 Group Theory 


SOLUTION » Referring to Fig. 13-7, ethene yields the answers “no, no, no, yes, (no;" so choose 
any one. We arbitrarily take C? along the C-C axis} “no, yes,” (go to D branch) “yes”, so D2}. Our 
choice of principal axis puts the C-C bond vertical. Using that we can identify the z MO as Bo, 
and the zx: * MO as B3g. The product of characters for B2uB3g —11—1—1—1—11 1, which 
can be seen to have Bj,, symmetry. So f z*(x, y, or z)z dv £0 only if x, y, or z also has By, 
symmetry. z does, so the transition is allowed and is z-polarized (i.e., it is a parallel transition.) «4 


In this chapter we have seen how formal group theory can be used to characterize 
MO symmetries, construct symmetry orbitals, and indicate whether integrals vanish by 
symmetry. Itistruethatonecan perform MO calculations and get correct results without 
explicitly considering symmetry or group theory, since the computational procedures 
satisfy symmetry considerations automatically. But group theory allows a much deeper 
understanding of the constraints that symmetry places on a problem and often leads to 
significant shortcuts in computation. 

A notable feature of group theory is its hierarchy of concepts. At the lowest level are 
the symmetry operations and the basis functions they operate on. At the intermediate 
level are the representations for the group, produced from the basis functions. At the 
highest level are the characters, produced from the representations. The characters 
provide the "handles" that we actually work with, but our interest is often focused on 
the basis functions to which they are related. This tends to lend an air of unreality to 
the use of group theory. An aim of this chapter has been to avoid this feeling of unre- 
ality by dispensing with formal proofs, and instead illustrating relationships through 
investigation of examples. Further insight should come from solving the problems at 
the end of this chapter. 


13-19.A Problems 


13-1. Do the following operations constitute a group? “come 90° to port" ( P) “come 
90° to starboard” (S) “steady as she goes" (E). 


13-2. The text indicates that every element in the group has an inverse if E appears in 
each column of the multiplication table. But E also appears once in each row. 
What does this mean? 


13-3. Consider the group of four operations of the drill soldier (Section 13-2). 


a) To which symmetry point group is this set of four operations isomorphic (i.e., 
which group has the same product relationship)? 

b) Based on the mathematical definition of class and Table 13-1, how many 
classes are there in this group? 

c) Based on your physical intuition about kinds of operation, how many classes 
would you have anticipated for this group? If there is a discrepancy between 
(b) and (c), try to explain it. 


13-4. The C3, (ammonia) group is of order six. This is the same as the number of 
ways one can place three hydrogens at the three corners of an equilateral triangle 
(3-2-1). When we consider the C4, group, we have 24 ways we can place four 
hydrogens at the corners of a square (4-3-2-1). But the C4, group only has 
order eight. Explain. 
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13-5. For each of the molecules (IV)-(VID, 


cl cl 
Hg Ny „H Me 
CH;Cl; 'B B M CH, 
u^ Nu Ya cid NS 
(IV) (V) (VD (VID 


a) list the symmetry elements, 

b) calculate the number of symmetry operations for each element, 

c) obtain the order of the group, 

d) determine the group symmetry symbol, 

e) check your results for (a-c) against the appropriate character table in 
Appendix 11. 


13-6. a) Demonstrate that U is a unitary matrix. 
b) Demonstrate that U' AU is diagonal. 


jo en as 
U=| v2 J2 A= 0 1 
SR E 1 0 
4/2. 42 
13-7. Assign the following molecules to point groups, look up their character tables, 
and indicate in each case whether one could expect doubly degenerate MOs. 


a) C6H6 

b) CH2Cl2 

c) B2Hg (see Problem 13-5) 
d) Staggered C2H6 

e) Staggered CH3CCl3 


13-8. Consider the planar molecule COi- (VIII). The oxygen atoms are at the corners 


of an equilateral triangle. 
(0) m 
o^ ^o 


(VIII) 


a) What is the point group of this molecule? 
b) Using the appropriate character table, assign a symmetry symbol to each of 
MOs (IX)-(XII). 


Q O eo o 
o 


C š i 
o oo 99 9 e^ on e 
IX X XI 


XII 
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13-9. 


13-10. 
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Table P13-9 gives the eigenvalues and eigenvectors resulting from an extended 
Hückel calculation of the allene molecule (XIID. The molecule is aligned 
as shown with respect to Cartesian axes. Ascertain the point group for this 
molecule. Using the character table for this group, assign a symmetry symbol 
to each MO. Is a transition from the highest occupied MO level to the lowest 
unoccupied level allowed by symmetry for any direction of polarization? 


y 


Hs 
Hz.. LU 
Cs=Ci=C, —————— X 
HT N 


H4 z 
(XII) 


Consider the water molecule, oriented as shown in Fig. P13-10 with the y axis 
perpendicular to the molecular plane and the z axis bisecting the H-O—H angle. 


a) Figure out as many nonredundant symmetry operations for this molecule 
as you can, and set up their multiplication table. Ascertain that you have a 
group of operations by checking closure, etc. 

b) Use the functions z, Rz, x, and y as bases to set up a character table for 
this group. 


TABLE P13-9 » Extended Hückel Molecular Orbitals for Allene 


Ci C2 
MO Energy 
no. (a.u) 2s 2pz 2px 2p, 2s 2pz 2p. 
1 1.7868 0 0 1.58 0 —1.23 0 0.58 
2 1.4916 1.52 0 0 0 —1.01 0 0.57 
3 0.4927 —0.54 0 0 0 —0.51 0 —0.75 
4 0.3681 0 0 0.67 0 0.36 0 0.83 
5 0.3231 0 0 0 —0.34 0 0 0 
6 0.3231 0 0.34 0 0 0 —0.11 0 
7 | —0.2619 0 0 0 0.85 0 0 0 
8  —0.2619 0 0.85 0 0 0 —0.78 0 
9  —0.4326 0 0.56 0 0 0 0.67 0 
10 | —0.4326 0 0 0 —0.56 0 0 0 
11  —0.4881 0 0 —0.49 0 —0.08 0 0.41 
12 —0.5558 0 —0.16 0 0 0 —0.04 0 
13 —0.5558 0 0 0 0.16 0 0 0 
14 —0.6221 0.43 0 0 0 —0.14 0 —0.32 
15  —0.8102 0 0 —0.13 0 —0.44 0 —0.02 
16 | —0.9363 0.47 0 0 0 0.36 0 —0.02 
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x 


Figure P13-10 » Relation of the water molecule to cartesian axes. 


c) Use the resulting table to find out the symmetries of all MOs that can contain 
1s AOs on the hydrogens. 

d) Produce the symmetry combinations of these AOs that can appear in the 
MOs of water. 


Consider the trans-chlorobromotetramine cobalt (IID ion (XIV). 


a) Find the appropriate symmetry elements for this molecule and set up their 
group multiplication table. (Ignore the hydrogens on the ammonias.) Check 
the multiplication table to be sure all requirements for a mathematical group 
are satisfied. 


Atomic Orbital Coefficients C5 
2py hg h4 2s 2pz 2Px 2p, he h7 
0 0.30 0.30 1.23 0 0.58 0 —0.30  —0.30 
0 0.24 0.24 —1.01 0 —0.57 0 0.24 0.24 
0 0.61 0.61 —0.51 0 0.75 0 0.61 0.61 
0 —0.58 —0.58 —0.36 0 0.83 0 0.58 0.58 
1.27  —0.88 0.88 0 0 0 0.11 0 0 
0 0 0 0 —1.27 0 0 0.88 —0.88 
0.01  —0.21 0.21 0 0 0 —0.78 0 0 
0 0 0 0 0.01 0 0 —0.21 0.21 
0 0 0 0 —0.10 0 0 —0.16 0.16 
0.10 0.16 —0.16 0 0 0 —0.68 0 0 
0 0.17 0.17 0.08 0 0.41 0 —0.17 —0.17 
0 0 0 0 —0.48 0 0 —0.44 0.44 
—0.48  —0.44 0.44 0 0 0 —0.04 0 0 
0 —0.24 —0.24 —0.14 0 0.32 0 —0.24 —0.24 
0 —0.18 —0.18 0.44 0 —0.02 0 0.18 0.18 
0 0.07 0.07 0.36 0 0.02 0 0.07 0.07 
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b) Use the following as bases for representations: z, Rz, x, y, x? — y^, xy. 
Make sure you get all the inequivalent irreducible representations allowed 
in the group by checking x =h. 

c) Set up the character table. Now ascertain which symmetry orbitals contain 
2s orbitals of nitrogen. Give the symmetry combinations of those AOs that 
appear in these symmetry orbitals. 


13-12. Use the relationships that must exist among characters to complete the following 
tables. Include proper symbols for the representations, but do not include bases. 


E C2 Oy O. j E 2C3 3C; On 283 30y 


(a) z (b) 


13-13. The Ds group has four classes of operation and has order 10. How many 
inequivalent irreducible representations are there and what are their dimensions? 


13-14. Consider the structure shown in Fig. P13-14. 


A 


uo s 


c——_/—_\ 8 


B—————C 


Figure P13-14 » Square pyramid. 


a) Figure out the symmetry elements and operations for this molecule. 

b) What is the group order and number of classes? 

c) How many inequivalent irreducible representations are there and what are 
their dimensions? 

d) Ascertain the group symbol and compare your answers with the character 
table in Appendix 11. 


13-15. A group has the following representations: 41, 42, B1, B», E1, E2. What is the 
group order and how many classes are there? 


13-16. Find the matrices that transform s and p STOs into sp? and sp? hybrids (Sec- 
tion 13-18). Demonstrate that these are unitary matrices. 


13-17. It has been argued (Section 13-18) that sp? hybrid orbitals are appropriate basis 
functions for CH} (Dza). Could one use a basis set of sp? hybrid orbitals for 
this system? 
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In each of the following cases, resolve the given character set. If these were 
characters of integrands, would the integral vanish by symmetry? For example, 


C3v:401 A, PBAr@GBE No. 


a) Cy: 5 -1 1 1 -3 
b) Da: 3 -1 -1 3 -1 3 3 -1 
c) Dza: 8 2 0 0 0 0 


Referring to the data in Problem 13-9, which integrals below must vanish by 
symmetry? 


a) f óexéiodv 

b) f óeyóio dv 

c) f éix$o dv 

d) f ésxéia dv 

e) f dozóa dv 

Consider the possible electronic excitations of staggered ethane from its occu- 


pied leg MO to the various empty MOs of symmetry a1g, a2u, €g, and eg. Which 
of these are symmetry allowed and how are they polarized? 


Referring to the data in Problem 12-24, which transitions from MO 4 can be 
induced by y-polarized light? (Note: the molecular y axis is coincident with 
the symmetry z axis.) 


The order of a group for a given object is equal to the number of equivalent 
locations for any nonspecial point in space about the object. Without referring 
to tables, predict the order of the group for 


a) a square (both faces identical). 
b) a cube (all faces identical). 


Multiple Choice Questions 


(Answer the following questions without referring to text or tables.) 


1. Generate the symmetry operations for water, and choose the true statement from the 
set below. 


a) H20 has two C3 axes. 

b) H20 belongs to a group of order three. 

c) H20 belongs to a group having exactly three classes. 
d) H20 can have only nondegenerate MOs. 

e) None of the above statements is true. 


2. Which one of the following statements is true? 


a) All reflections in a group must belong to the same class. 
b) All linear molecules must have an inversion center. 
c) An Sg axis has exactly two nonredundant operations. 
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d) Use of a basis set of hybrid orbitals built from a minimal basis set of STOs leads 
to a different variationally minimized energy than does use of the minimal basis 
set of STOs without hybridization. 

e) None of the above is a true statement. 


3. Which one of the following is not a symmetry element for eclipsed ethane? 


a) i 

b) S3 
c) Cy 
d) oy 
e) On 


4. Which of the following nondegenerate MOs is (are) possible for the CO2 molecule? 


OOO OXO Zook 


1 3 4 
a) 1 only. 

b) All four are possible. 

c) 1, 2, and 3 only. 

d) 2 and 3 only. 

e) 1 and 3 only. 


5. The presence of an S4 axis guarantees the presence of 


a) a point of inversion. 

b) a o; reflection plane. 

C) a C4 axis. 

d) a C5 axis. 

e) None of the above is correct. 


6. A molecule having only the Æ symmetry operation can have 


a) only nondegenerate MOs. 

b) MOs of any degeneracy. 

c) an infinitely degenerate level. 
d) only doubly-degenerate MOs. 
e) None of the above is correct. 


7. An MO that is symmetric or antisymmetric for every symmetry operation of a 
molecule 


a) must be degenerate. 

b) cannot be variationally altered. 

c) must be the lowest- or highest-energy MO. 
d) must be nondegenerate. 

e) None of the above is correct. 
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8. The character for a reducible representation of C4 on a square planar molecule 
equals 


a) 0 
b) 1 
c) 2 
d) —2 
e) 4 


9. Consider the planar molecule-ion NOCI; ] 


The number of reflection planes, o, three-fold rotation axes, C3, two-fold rotation 
axes, C2, and inversion centers, i, that are symmetry elements for this molecule- 
ion axis 


oc G C5 i 
a 1 0 1 0 
a) 2 0 1 0 
b 2 0 1 1 
c) 2 1 2 O 


d) None of the above is correct. 
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Chapter 14 


Qualitative Molecular Orbital Theory 


| J 14-1 The Need for a Qualitative Theory 


Ab initio and semiempirical computational methods have proved extremely useful. But 
also needed is a simple conceptual scheme that enables one to predict the broad out- 
lines of a calculation in advance, or else to rationalize a computed result in a fairly 
simple way. Chemistry requires conceptual schemes, simple enough to carry around 
in one's head, with which new information can be evaluated and related to other infor- 
mation. Such a theory has developed alongside the mathematical methods described 
in earlier chapters. We shall refer to it as qualitative molecular orbital theory (QMOT). 
In this chapter we describe selected aspects of this many-faceted subject and illustrate 
QMOT applications to questions of molecular shape and conformation, and reaction 
stereochemistry. 


[J 14-2 Hierarchy in Molecular Structure 
and in Molecular Orbitals 


We seek a simple qualitative approach to the question, *How does the total energy 
of a system change as the nuclei move with respect to each other?" This ques- 
tion is very broad, encompassing the phenomena of molecular structure and chemical 
reactivities. 

Itis useful to distinguish three kinds of process that can occur as nuclei move. One 
of these is the process in which two nuclei move closer together or farther apart, with 
their separation being somewhere around a bond length, either at the outset or the 
conclusion of the motion (or both). This process includes the breaking or forming of 
bonds and also the stretching or compressing of bonds. It also includes the forcing 
together of two species that will not bond (e.g., He with He). Let us refer to this as 
a nearest-neighbor interaction, even though the two nuclei need not be bonded in the 
usual chemical sense. The second process is the changing of the bond angle between 
two nuclei bonded to a third. The changing of the H-O-H angle in water is an example. 
A necessary consequence of such a change in angle is a change in distance between 
the two moving nuclei (here H-H). In geometries normally of interest, however, this 
distance is somewhat greater than a typical bond length throughout the entire process. 
We refer to this process as bond-angle change. The third process is the rotation of one 
part of a system with respect to the other about some axis (usually a single bond in the 
system). An example is rotation of one methyl group in ethane with respect to the other. 
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We refer to this process as a torsional angle change, or an internal rotation. Such a 
change will produce changes in distances between nuclei located on opposite ends of 
the torsional axis, but these distances typically remain several times as great as a bond 
length throughout the entire process. 

Chemists have long recognized that the energies associated with these three kinds of 
change fall into a loose hierarchy, with nearest-neighbor interactions having the great- 
est effect on energy, bond-angle changes having a smaller effect, and torsional angle 
changes the least. Indeed, for this reason spectral transitions corresponding to stretch- 
ing, bending, and torsional modes are found in different regions of the electromagnetic 
spectrum. 

Most nearest-neighbor interactions are a consequence of the connectedness, or 
topology, of a molecule. This aspect of molecular structure, sometimes called the first- 
order, or primary structure, is the first aspect one considers when establishing structure, 
and it is the first aspect that chemists became aware of, historically. The second-order 
aspect of structure concerns the bond angles. Once those are at least roughly known, 
one can go on to consider the third-order, or tertiary structure resulting from torsional 
energetics. The last aspect is usually referred to as the conformation of the system. 

It is advantageous to discuss MOs from a similar viewpoint. If we wish to guess 
the nature of the MOs of a system (i.e., where they have their nodes) and the MO 
energy order, we first consider the topology of the system. (Recall that this is the 
only thing that the simple Hückel method considers.) Elementary arguments lead to a 
fair approximation of the appearance and relative energies of the MOs. Next, we can 
consider bending the system, bending the MOs along with it. By judging whether MO 
energies will rise or fall in this process, we shall show that one can often predict whether 
the molecule will be more stable in the linear or bent form. Finally, when we know 
the second-order structure, we can imagine the various conformational possibilities, 
allowing the MOs to be carried along with the nuclei. Again, by judging how the MO 
energies respond, it is possible to make predictions as to which conformation is most 
stable. 

In order to formulate rules for QMOT, we will return to the Hj molecule ion and the 
H2 molecule. Then, using insights gained there, we shall consider more complicated 
systems. 


[p 14-3 HZ Revisited 


In Chapter 7 we used the linear variation method to solve the minimal basis Hy problem. 
However, symmetry conditions alone suffice to force the solutions to be 


Vo, = l/y2(1 + S)(1sa + 1s), Eg (el) = (Haa + Hap)/( +S) (14-1) 
Vo, = 1/V20 — S)(1s, —1s5), | Eu (e) = (Haa — Hap)/U— $8) (14-2) 


where Haa and Hap are negative energies, and S is the (positive) overlap integral 
between Is AOs on nuclei a and b. The energies E;(el) and E,(el) are upper bounds 
for the electronic energy of the og and oy states. The total energies are obtained by 
adding the internuclear repulsion energy 1/R a.u., where R is the internuclear distance 
in atomic units. 
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Figure 14-1 » Steps in compiling total energies for minimal basis LCAO-MO calculation on Hr 
at R —2 a.u. 


It is instructive to consider the total energies, E;(tot) and Ey (tot) as the result of an 
artificial step-by-step procedure as indicated in Fig. 14-1. We begin with a 1s AO on 
nucleus a at E = -i a.u. If we now bring proton b up to a distance of 2 a.u., but do 
not allow the 1s AO to respond, we get an energy lowering due to the increased nuclear 
attraction now felt by the electron in 1s;. At R — 2 a.u., this additional attraction lowers 
the energy by 0.472 a.u., giving Haa = —0.972 a.u. Let us refer to this as the frozen AO in 
molecule energy. (It may be thought of as the energy to first order due to a “perturbation” 
caused by the approach of proton 5, although this is hardly a small perturbation.) As 
our next step, we include Hap. This term equals —0.699 a.u., and it splits the energy 
evenly around H44, as shown in the figure. The resulting energies are the expectation 
values for the electronic energies of the unnormalized (due to omission of overlap) 
functions 27! (1s, + 1s5). Two things have happened in this step: The electronic 
charge has become delocalized over both centers, and it has changed in amount so that 
we no longer have one electron in each MO. Because the overlap term S (equal to 0.586 
at R —2 a.u.) has not yet come into the calculation, we have here the energies due to 
1 + S, or 1.586 electron in the o; MO, and 1 — S, or 0.414 electron in the oy MO. 

Before proceeding to the next step, let us examine the two energies at this point to 
see how they compare for equal amounts of charge. The 0.414 electronic charge in o is 
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associated with an energy of —0.273 a.u., whereas the 1.586 charge in og corresponds 
to —1.671 a.u. Itis clear that the o; MO is inherently of lower energy (relative to E — 0) 
per unit charge, due to the detailed nature of kinetic and nuclear electronic energies. 
(We will not concern ourselves with these detailed aspects, however.) 

Our next step is to normalize the charges by dividing by 1 + S. This essentially adds 
0.586 electron to oy and subtracts 0.586 from og. From the figure, we see that this lowers 
the energy of oy and raises that of og. This is reasonable: Adding more charge to an 
MO of negative energy should make its energy contribution more negative, removal of 
charge should make it less negative. A very important fact, though, is that og rises more 
than oy goes down. This simply reflects our observation in the preceding paragraph 
that og charge “has lower energy per unit charge,” so removal of it “costs more" than 
is gained by putting it into oy. A useful summary of the effect of renormalization is: 
Renormalization tends to cancel the energy level changes due to nuclear motion. The 
higher an energy level is, the less effective is this cancellation. Therefore, net energy 
changes (due to nuclear motion) tend to be greatest for the antibonding member of a 
bonding-antibonding pair of levels, and, in general, greater for a higher-energy MO 
(bonding or antibonding) than for a lower-energy MO. 

The final step is to add the internuclear repulsion energy of 1/2 a.u. to each level. It 
is important to notice that the final total energy levels are related to the initial H atom 
energies in almost the identical way that F,(el) and E, (el) are related to the *atom-in- 
molecule" energy Haa (see Fig. 14-2) because the energy lowering due to the original 
1s; electron being attracted by proton b (—0.472 a.u. at R=2a.u.) is fairly close in mag- 
nitude to the repulsion between the protons (4-0.5 a.u.). We expect this near cancellation 
to hold as long as R is large enough that proton b is “outside” the charge cloud due to the 
1s, electron. In other words, the total energies should lie above or below the separated- 
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Figure 14-2 » Total and electronic energies for bonding and antibonding states of He compared 
to appropriate reference energies. 
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atom energy in much the same way that the electronic energies lie above or below the 

"atom-in-molecule" energy, if the internuclear separation is not too small. [We know 

that, as R approaches zero, the internuclear repulsion approaches infinity, whereas the 

exact values of E, (el) and E, (el) respectively, approach the united atom Het energies 

of —2 a.u. (1s) and —0.5 a.u. (2po), and the cancellation ultimately breaks down.] 
From this discussion of Hy , the following ideas emerge: 


1. The energy of an MO is lowered by bonding interactions, raised by antibonding 
interactions, relative to the appropriate reference energy. 


2. Antibonding interactions are inherently more destabilizing than bonding interactions 
are stabilizing. 


3. A parallel exists between electronic energy change and total energy change if we 
are careful about reference energies and internuclear distances. 


But Hj is atypical of the systems we will be interested in treating by QMOT. We 
will usually be considering neutral, many-electron systems, and Hy is a charged, one- 
electron system. Therefore, we turn next to the molecule H» to consider how well the 
above ideas carry over to a more typical system. 


EXAMPLE 14-1 The simplest MO treatment of H) would follow an HMO-like 
procedure, and assume that two electrons in the log MO should give a total energy 


for H? of 2F (tot, Hj), or 2(—0.554a.u.) = —1.108a.u. The correct total energy 
for H? is — 1.1745 a.u. (Table 7-4), so the agreement is better than one might expect. 
Is this because our procedure is physically correct? 


SOLUTION » There are several errors in the procedure. Most obvious is that E(tot) for Hy 
includes the repulsion between two protons, so our value for H» contains double that nuclear 
repulsion. But both molecules have a pair of protons. So how come the result agrees so well with 
the exact value? For one thing, doubling E(tot) for Hy fails to include interelectronic repulsion, 
so we fail to include any interelectronic repulsion but we include too much internuclear repulsion. 
In addition, Re for H» is about 3/4 that for H+, so Van in H3 is really about 4/3 Vnn in HŽ. Also, 
Vne in these two species differs, both because the protons are not separated by the same distance 
and because the electron cloud is swollen by the presence of interelectronic repulsion. Because 
the electronic potential energy changes, the kinetic energy changes too. We are observing here 
the result of partial cancellation of several errors. 4 


[J 14-4 Hz: Comparisons with H7 


We first consider the ramifications of the neutrality and nonpolarity of H2. Imagine that 
we have a ground-state hydrogen atom H4 and we allow another similar atom, Hg, to 
approach it. At values of R in excess of, say, 2 a.u., we expect the perturbation felt by 
H, to be much smaller than was the case in our Hy discussion. This is because, where 
before we had approach by a charged particle, here we have approach by a neutral atom. 
The attraction between the electron on Ha and the proton on Hy is counterbalanced 
by repulsion between the electron on H, and that on Hs. This means that the frozen 
atom-in-molecule energy for H» is fairly close to the energy of the isolated atom. 
This simplifies our qualitative treatment for neutral molecules since it means we can 
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meaningfully compare molecular electronic energies directly with unperturbed atomic 
electronic energies. 

We now come to a consideration of the orbital energies of Ho. Here we will find 
that there are important differences between a one-electron system like Hj and multi- 
electronic systems. Consider the lowest (10g) MO of H2. According to the description 
of ab initio theory in Chapter 11, the "energy of" this MO at some internuclear sep- 
aration R [call it E log (R)] is equal to the energy of an electron in this MO (i.e., the 
orbital energy is equal to the one-electron energy). This energy results from kinetic 
and nuclear attraction components and from repulsion for the other electron. But where 
is this other electron? If we are considering the ground state of Hp, it is in the log 
MO also. That gives us an energy we might call E log (R, log). However, we might 
instead be considering an excited state of H2, perhaps with the configuration logloy. 
Depending on whether we choose the symmetric or antisymmetric combination for the 
spatial part of the wavefunction, we shall be considering an excited singlet or triplet 
state. In each case, repulsion for the log electron will be different. Thus, we already 
have three orbital energies for this lowest MO, and we could continue getting more by 
considering other excited states of H2. Clearly, the orbital energies in a multielectronic 
molecule are dependent on the state being considered. 

Another important feature of one-electron energies in multielectronic systems is that 
they do not add up to the total electronic energy. That is, 2 E15, (R, log) is not equal 
to the electronic energy of H» at R. As pointed out in Chapter 11, this is because 
E log (E, log) includes the interelectronic repulsion, so 2 £ log (R, log) counts the inter- 
electronic repulsion twice. 

One might imagine that, since MO energies in multielectronic systems are state- 
dependent and cannot be simply summed to give the electronic energy, it is hopeless 
to use them as a basis for explanation or prediction. However, this would be too 
pessimistic. It turns out that the qualitative features of MO energies are not all that 
sensitive to change of state (see Fig. 14-3) and, besides, we are usually concerned with 
the ground state or with excited states wherein most of the electrons remain in orbitals 
occupied in the ground state. Also, the extra measure of interelectronic repulsion that is 
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Figure 14-3 » Orbital energies as function of internuclear separation for He». The two curves for 
€1o, are qualitatively similar, although far from identical. (From Yarkony and Schaefer [1].) 
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included in the sum of one-electron energies can sometimes be expected to compensate 
roughly for the as yet unincluded internuclear repulsion energy. This rough equality 
requires that our system be neutral (so that the total number of repelling negative charges 
is equal to the total amount of positive charge) and nonpolar (so that the loci of charge 
are similar). We expect the equality to be quite good at large internuclear separation, 
where the electrons of each atom repel those of others as though they were centered 
on the nuclei, and to become progressively poorer as the atoms get closer together and 
undergo interpenetration of charge clouds. If we accept this rough equality, then we 
are permitted to approximate the change in total energy of the system (when the nuclei 
move), as equal to the change in the sum of one-electron energies. 


N 14-5 Rules for Qualitative Molecular Orbital Theory 


The considerations discussed in the two preceding sections provide a rather loose jus- 
tification for the following QMOT rules related to total energy changes when nuclei 
are moved. 


1. An increase in overlap population between two AOs tends to lower the energy of an 
MO; a decrease tends to raise it. 


2. The effects of a given amount of orbital overlap population change on orbital energy 
are much more pronounced for higher-energy MOs. A corollary is: The destabi- 
lizing effect of an antibonding relation between two AOs tends to be greater than 
the stabilizing effect of a bonding relation, other factors (AO identities, distances) 
being equal. 


3. The sum of changes of one-electron energies should approximately equal the total 
energy change if we are treating a neutral, nonpolar system and if the nuclei that 
are moving with respect to each other are separated by a distance of a normal bond 
length or more. 


Rule (2) and its corollary are due to the different effect of renormalization on orbitals 
of different energy. Note that rule three refers only to energy changes. We do not expect 
the sum of one-electron energies to equal the total energy. This becomes obvious in 
the limit of infinite separation of atoms (e.g., in CO2) in which there is no internuclear 
repulsion, but interelectronic repulsion within each atom is still being counted twice. 

These rules are normally applied only to MOs made from valence-shell AOs. Inner- 
shell electrons are only very weakly perturbed in normal nuclear motions. The resulting 
small changes, while detectable and useful for analytical purposes, are inconsequential 
compared with valence electron effects. 


[J 14-6 Application of QMOT Rules to Homonuclear 
Diatomic Molecules 


If we apply our rules from the preceding section to the motion of a pair of iden- 
tical nuclei, each carrying a 1s AO, we obtain the orbital energy versus R curve 
shown in Fig. 14-4. The og MO energy is predicted to drop as overlap population 


lFor comments on this point, see Ruedenberg [2]. 


Section 14-6 Application of QMOT Rules to Homonuclear Diatomic Molecules 491 


R increases 


Figure 14-4 » Energy versus internuclear separation R for two nuclei with 1s AOs, derived from 
QMOT rules. 


increases, the oy to rise as out-of-phase overlap increases, and oy rises faster than 
og drops. In QMOT, we argue from this figure that one-, two-, or three-electron 
homonuclear diatomic molecules should be stable (Hf, H5, H5, He; ) and that 
four-electron molecules (He?) should be unstable. Since a similar figure holds for 
molecules in which the valence orbital is 2s, 3s, etc., we might also expect systems 
like D. Liz, Li, , Bej, Naj, Naz, Na, , Mg; to exist, but not Be? or Mg,. This set 
of predictions is a reasonable starting point, and in many cases agrees with observation 
(see e.g., Table 7-2). By the time we get to the Na-Mg series, however, the energy 
difference between 3s and 3p AOs is so small that it is unrealistic to treat these as 
“pure” 3s cases. Consequently, we must put less faith in simple predictions for those 
molecules. The comparison between our QMOT guess about these molecules and 
experimental or ab initio observation is shown in Table 14-1. 

The reader may have noticed that we have violated one of our QMOT conditions by 
considering nonneutral systems. One is presumably less safe in including such systems, 
although several of them have been studied and found to fit the QMOT prediction. 


TABLE 14-1 » Stability of Some Homonuclear Diatomics as Predicted by QMOT and as 
Observed from Experiment or ab initio Calculation 


Experimental Experimental 
Molecule QMOT or ab initio Molecule QMOT or ab initio 
Hy Stable Stable Be, Stable Stable 
H2 Stable Stable Bez Not Stable Not Stable 
Hy Stable Stable? Na, Stable Stable 
He; Stable Stable Na» Stable Stable 
He» Not Stable Not Stable Na, Stable Ms 
L5 Stable Stable Mg; Stable ? 
Lio Stable Stable Mg» Not Stable Stable (?) 
Li, Stable Stable 


“This molecule-ion is unstable with respect to losing an electron and forming this neutral molecule at its lowest 
energy, but it is stable with respect to dissociation into a neutral atom and a negative ion in their ground states. 
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Apparently, if one is concerned only with the question of the presence or absence of 
a valley in the total energy curve (and not with relative depths of valleys or relative 
Re values), deviation from neutrality by one electron is not too damaging (as we saw 
for Hj ). However, for doubly positive diatomics, like He?*, the internuclear repulsion 
dominates so that there is no stable species. 

One can add p-type AOs to the two nuclei and expand the orbital energy plot, as 
illustrated in Fig. 14-5. Filling in the orbital levels with electrons leads to the prediction 
that B2 is singly bonded, C2 is doubly bonded, N2 triply bonded, O2 doubly bonded, 
Fz singly bonded, and Ne» not bonded. These systems, with their singly charged 
relatives, have been described in Chapter 7, and we will say no more about them here 
except that the agreement between QMOT rules and observations is quite respectable 
(see Fig. 14-6). 

An important distinction exists between the energy versus R curves drawn in 
Fig. 14-5 and those in Fig. 7-17. The latter set was constructed by sketching orbital 
energies for the united-atom and separated-atom limits, and linking these energies 
together using symmetry agreement and the noncrossing rule. The former set was con- 
structed by sketching orbital energies for the separated atoms, and then using QMOT 
rules to decide which curves go up, and which go down in energy as R decreases. 


2po, 2Pr, 2Pr, 


?Po, Px, 2Pr, 


2s, and 2s, 


Figure 14-5 » Qualitative sketches of homonuclear diatomic MO energies as a function of R based 
on QMOT rules. 
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D, (eV) 


2 3 4 5 6 5 4 3 2 1 0 


0 1! 2 1 O 1 2 1 0 
H; H, He; He, Li! Li, Li Be, B; B, C; C, N,* N, N, O: O, F, Ne," Ne, 
Hi Be; Cc; 0; Fr; Fr 


Figure 14-6 » De versus aufbau sequence for homonuclear diatomic molecules and ions. The 
tendency for systems having more net bonding electrons to have a greater De is adhered to fairly well 
(see Table 7-2 for data). 


When we wish to emphasize this distinction, we will refer to these as two-sided and 
one-sided correlation diagrams, respectively. 

According to the simple QMOT viewpoint, we should expect a bonding MO to be 
lower in energy than the AOs that comprise it. Likewise, an anti-bonding MO should 
be higher in energy. We can test this by comparing first ionization energies (IEs) for 
molecules against those for the constituent atoms. (Recall that Koopmans' theorem 
equates orbital energy to ionization energy.) If the highest occupied MO (HOMO) of 
the molecule is bonding, the IE for the molecule should be greater than that for the 
atom. If it is antibonding, the IE of the molecule should be smaller. Strictly speaking, 
the HOMO of the molecule may consist of mixtures of AOs on each atom. In that 
case, we should compute a *valence state ionization energy" for the atom. Also, for 
heteronuclear molecules AB, we need to know what percentage ofthe HOMO to identify 
with each atom in order to obtain a suitable average atomic IE to compare with the IE 
of the molecule. Even if we ignore these corrections, however, and simply compare 
experimental IEs, the anticipated behavior is shown nicely by diatomic molecules, as 
indicated in Table 14-2. 

There is much similarity between the QMOT rules and the assumptions inherent in 
the extended Hückel method described in Chapter 10. There, also, a bonding interaction 
is equated with energy lowering and an antibonding interaction with an energy rise. 
Furthermore, the sum of orbital energies is assumed to change in parallel with the 
total energy change, even though the internuclear repulsion is not included. We noted 
that this assumption limits the range of validity for EHMO calculations to relative 
nuclear motions at distances that are on the order of a bond length or greater. The 
EHMO method is, in essence, the numerical equivalent to the qualitative MO approach, 
and such calculations can serve as a guide for developing qualitative explanations in 
complicated situations or for producing numbers that enable one to compare QMOT 
with experimental or ab initio results. 
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N 14-7 Shapes of Polyatomic Molecules: Walsh Diagrams 


In this section we will describe how the rules and concepts of QMOT enable one to ratio- 
nalize and predictmolecularshapes. Theearliestsystematictreatmentofthis problem was 
given by Walsh,” whose approach has been extended by others, particularly Gimarc [6].? 

We begin by considering the symmetric triatomic class of molecules HAH, where 
A is any atom. Such molecules can be linear or bent. Walsh's approach predicts which 
are linear, which are bent, and sometimes which of two bent molecules is more bent. 

We approach the problem in the following way. First we sketch the valence MOs 
for the generalized linear molecule HAH, deciding which is lowest, second lowest, etc. 
in energy. Then we imagine bending the molecule and argue whether each MO should 
go up or down in energy on the basis of our QMOT rules. This produces a chart of 
orbital energies versus bond angle—a one-sided correlation diagram. Finally, we use 
this diagram to argue that HAH will be linear or bent, depending on how many valence 
electrons HAH has and on how they are distributed among the MOs. 

The first problem is to sketch the MOs for linear HAH and decide their energy order. 
Probably the simplest way to do this is through use of symmetry and perturbation argu- 
ments. We know that the linear molecule belongs to the Doon point group, possessing 
a center of inversion and a reflection plane through the central atom and perpendicular 
to the HAH axis. This means that the two hydrogen 1s AOs (1s; and 1s2) will appear in 
MOs in the symmetry combinations Øg = 1s; + 1s2, $4 = 181 — 1s2, where g and u stand 
for gerade and ungerade, respectively (see Chapters 7 and 13 for a background discus- 
sion). Recognizing this, we can next consider which MOs will result from interactions 
between the valence AOs on atom A and the symmetry orbitals g and gy. A perturbation- 
type diagram for this appears in Fig. 14-7. We have assumed that only the valence 
s and p AOs on A are involved in bonding. Extension to include d AOs is possible. 

On the right side of Fig. 14-7, the symmetry orbitals g and u are shown to be 
slightly split. This reflects the greater stability ofthe in-phase, or bonding, combination. 
However, the splitting is slight because the hydrogen atoms are quite far apart (so that 
atom A can fit between them). These two levels sandwich the separated-atom limit of 
-j a.u. or —13.6eV. 

The AOs of atom A are sketched on the left. Their energies are arranged so 
that the 2p energies are about the same as the g, u symmetry orbital energies on 
the right. (For example, AO energies used for nitrogen in EHMO calculations are: 
2s ~ —25eV, 2p ~ —13eV .) For the linear molecule, we can label the s and p AOs on 
atom A as ø or x and g or u. 

To generate the MO energy-level pattern from the interactions between these AOs 
on A and ¢g, ¢y, we use the following rules from perturbation theory (see Chapter 12): 


1. Interactions occur only between orbitals of identical symmetry. 


2. Interactions lead to larger splittings if the interacting orbitals are closer in energy 
(overlap considerations being equal). 


The resulting energy levels appear in the central column of Fig. 14-7. 
We need sketches of the MOs whose energy level pattern we have just approximated. 
We can guess the qualitative appearance of these by recalling that, when two orbitals 


?See Walsh [4] and the papers immediately following. See also Mulliken [5]. 
3For a critical review of the theoretical validity of Walsh's method, see Buenker and Peyerimhoff [7]. 
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Figure 14-7 » Orbital energies of MOs formed by interaction of oy and og symmetry orbitals on 
Hı ... H2 with valence s and p AOs on central atom A. The 2 AOs and MOs are degenerate but are 
sketched as slightly split. 


interact to give splitting, the lower energy corresponds to a bonding interaction, the 
higher energy to antibonding. Thus, for example, Fig. 14-7 indicates that the log MO 
is a bonding combination of the 2s AO on A and the og symmetry orbital, whereas 20, 
is the antibonding combination. The zt, MOs are simply the p, AOs on A, since there 
is nothing of the same symmetry for them to interact with. The six MOs for the HAH 
molecule are sketched in Fig. 14-8a. 

Before proceeding, notice that the lowest two MOs are A-H bonding, the next two 
are nonbonding, the highest two are antibonding. This is an example of the way in 
which topological, or nearest-neighbor, interactions govern the gross features of energy 
ordering. In fact, we could have generated this same set of MOs and energy order by 
simply sketching all of the MOs we could think ofthat were symmetric or antisymmetric 
forrelevant symmetry operations and then putting the bonding ones lowest (with s lower 
than p), nonbonding next, and antibonding highest (again recognizing that greater s 
character should yield lower energy). (Note thatthe nonbonding MOs are not symmetric 
or antisymmetric for arbitrary rotations about the Co; axis. Hence, they must form a 
basis for a representation of dimension greater than one. Hence, they are degenerate. 
See Chapter 13 for detailed discussion.) Electrons in the two lowest MOs produce A-H 
bonding. Electrons in the next two MOs have little effect on A-H bonding and, in fact, 
constitute what a chemist normally thinks of as lone pairs. Electrons in the two highest 
MOs tend to weaken the A-H bonds. We normally do not worry about questions of 
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Figure 14-8 » The Walsh-type correlation diagram for HAH: (a) linear Dah; (b) bent Coy. The 
cross-hatched parts of MOs have opposite sign from open parts. 


shape for systems where these highest two MOs are filled because such an HAH system 
is not even bonded (i.e., we do not worry about second-order structure if there is no 
stable first-order structure). Consequently, these two highest MOs will be omitted in 
many energy-versus-angle diagrams, although there are certain cases where they can 
be useful (e.g., in singly excited configurations) (see Problem 14-5). 


EXAMPLE 14-2 Referring to Fig. 14.8, give the QMOT rationale for the MOs in 
the lower-half of the figure having energies below those in the upper half. What is 


the rationale for the lowest (ag) MO being at lower energy that the second-lowest 
(a2,) MO, and why are these both below the next (eu) MOs? 
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SOLUTION » The lowest-energy half of the set are all C-H bonding MOs, the other half all 
being C-H antibonding. Because there are so many C-H bonds, their nature determines this 
separation. Within the C-H bonding set, the lowest-energy pair differ from the others in having 2s 
AOs on carbon, which are sufficiently lower in energy than 2p AOs to give them lower energy. The 
aig MO is C-C bonding, hence has lower energy than the a2, MO, which is C-C antibonding. < 


We now consider how the MO energies change upon bending the molecule. As 
log is bent, the two hydrogen AOs move closer together. Because they have the same 
phase, this leads to an overlap increase, but it occurs over a fairly long distance, being 
a second-nearest neighbor interaction. Therefore, there is an energy lowering, but it 
is not very large. Bending lo, leads to two changes. First, the 1s AOs move away 
from the axis of maximum concentration of the 2p AO. This causes a substantial loss 
of overlap and a substantial increase in energy. Second, the 1s AOs move closer to 
each other. They disagree in phase, and this also tends to increase the energy, although 
it is a relatively small effect because it occurs between second-nearest neighbors. In 
the linear molecule, 1z:;; and lu, MOs contain no contribution from hydrogen Is 
because the hydrogens are in a nodal plane in each case. If we imagine that, in bending 
the molecule, we keep the hydrogens in the xz plane, then we see that the hydrogens 
are remaining in the nodal plane for 175, but have moved away from the nodal plane 
of Izy,. Once this happens, the 1s AOs are no longer forbidden by symmetry from 
contributing to the zt; MO, and a “growing in" of 1s AOs occurs, leading to an MO like 
the 2a; MO drawn in Fig. 14-8. This behavior is more complicated than we observed 
for the lower-energy MOs because it involves more than a mere distortion of an existing 
MO. With a little experience, this additional complication is easily predicted (or, one 
can do an EHMO calculation and "peek" at the answer by sketching out the MOs 
contained in the output; see Problem 14-6). The effect on the energy of this 2a, orbital 
is quite large because several things happen, all of which are energy lowering: 


1. For a given amount of 1s AO, the bonding overlap with 2p increases with bending. 


2. For a given amount of 1s AO, the bonding overlap between the two hydrogens 
increases with bending. 


3. Since the amount of 1s AO present is not constant, but increases with bending (from 
zero in the linear configuration) the rate of energy lowering due to (1) and (2) is 
further augmented. 


Also, this is a fairly high-energy MO, and QMOT rule (3) tells us to expect such MOs 
to respond more dramatically to overlap changes. Finally, the overlap of a 1s AO with 
a 2p AO on another nucleus varies as cos@, where 0 is zero when the 2p AO points 
directly at the 1s AO. This means that the rate of change of overlap with angle is much 
greater around 0 — 90? than at 0 — 0? (see Problem 14-8). This is another reason for 
thinking that 174, — 2a1 will drop in energy much faster than lou — 15» will rise. 

The other m MO, Izy), undergoes no changes in overlap since the hydrogen atoms 
remain in the nodal plane throughout the bending process. Therefore, QMOT arguments 
predict no energy change for this MO. 

The highest two MOs change in energy in ways that should be obvious to the reader, 
based on the above examples. Since these are the highest-energy MOs, they should 
show further enhanced sensitivity to overlap changes. 

The MOs for the bent form are labeled in accordance with the symmetry notation for 
the C», point group, with a and b meaning symmetric and antisymmetric, respectively, 
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for rotation about the two-fold axis and 1 and 2 being analogous symbols for reflection 
in the plane containing the C2 axis and perpendicular to the molecular plane. The lowest 
valence MO of each symmetry type is numbered “1” despite the fact that lower-energy 
inner-shell orbitals exist. 

Because of the very qualitative nature of the arguments leading to Fig. 14-8, no effort 
is made to attach a numerical scale, either for energy or angle. 

We are now in a position to see how predictions based on our Walsh-type correlation 
diagram compare with experimental data. For molecules having only one or two valence 
electrons, we expect the preferred shape in the ground state to be bent. Examples are Hy 
and LiH;, both of which have been shown by experiment and/or accurate calculation 
to be bent. Molecules with three or four valence electrons should be linear, since 
the loy — 15» energy rise is much greater than the energy change for the lower MO. 
Examples are BeH? , BeH;, and BH} , which are indeed linear. Addition of one or two 
more electrons now brings the 17r, — 2a; MO into play, and we have already argued that 
the energy change of this MO should be considerably greater than that of 1o, — 152. 
Basically we have here a competition between a filled MO that favors the linear form 
and a higher, partially or completely filled MO favoring the bent form. We therefore 
might reasonably expect molecules in which 2a, is singly occupied to be bent, and 
molecules in which it is doubly occupied to be more bent. Occupancy of the lz, — 15; 
MO should have no effect on angle. Thus, that a molecule like BH»(1a, (10553? Qai) 
has an equilibrium bond angle of 131?, whereas SiH»(1a1)? (152)? Qai? has an angle 
of 97°, NH2(1a1)? (152)? Qa1)? (151) has 103? and H20(1a1)? (152)? Qai)? (154)? has 
105? is in pleasing accord with these simple ideas. 

Changes of angle upon electronic excitation also agree well with the correlation dia- 
gram. The triplet state of SiH2 resulting from the 2a1 — 151 excitation has a wider angle 
(124°) than does the ground state (97°). The excited singlet corresponding to the same 
excitation has a comparable angle (126°). NH», when excited from ... a4)? (151) to 
Qai) 051? opens from 103? to 144°. The isoelectronic PH2, under similar excita- 
tion, opens from 92° to 123°. There are other examples to support the validity of the 
HAH diagram, but these suffice to illustrate that this qualitative approach has consider- 
able generality and utility. Note again that singly charged cations appear to fit QMOT 
predictions despite the fact that there is less theoretical basis for success here. 


EXAMPLE 14-3 SiH» and H20 have rather similar H-A-H angles of 97° and 105°, 


respectively. How would you expect these angles to compare for SiH; and H20*? 


SOLUTION » SiH, loses an electron from the 2a, MO, and should open up to an angle in the 
130? range. H20 loses an electron from the 1b; MO, so its angle should not be greatly affected. 
(The value for H2O* is 110.59.) 4 


Walsh-type correlation diagrams have been constructed and discussed for many 
systems, among them AH3, HAB, HAAH, BAAB, H2AAH», B2AAB»?, H3AAH;.^ It 
is not appropriate that these all be described here. We will briefly discuss two more 
cases that bring in some additional features. 

Molecules with HAB configuration lack the high symmetry of HAH, and this means 
that the MOs are not as highly symmetry-determined as in HAH. Probably the simplest 


^See Gimarc [6]. 
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way to arrive at sketches for linear HAB MOs is to start with AB MOs (similar to 
A» MOs) and add the 1s AO of H in bonding and antibonding modes to form linear 
MOs. The results for the seven lowest-energy MOs (all A-H bonding or nonbonding) 
are seen at the left side of Fig. 14-9. (Diatomic MOs were discussed in Chapter 7.) 

We now imagine the hydrogen atom to move away from the AB axis, as shown, and 
use our QMOT rules to decide whether the energies should rise or fall. As before, we 
expect overlap changes between H and A (nearest neighbors) to have a greater effect 
on energy than those between H and B. 

The bent molecule has only one symmetry element, namely a reflection plane con- 
taining the nuclei. An a’ MO is symmetric under this reflection, a" is antisymmetric. 


27, . e» CY, ——— 


“6 

PX 

1a* 
(b) 


(a) 


Figure 14-9 » The Walsh-type diagram for the HAB system: (a) linear Cooy; (b) bent Cs. (After 
Gimarc [6].) Reprinted with permission from Accounts Chem. Res. 7, 384 (1974). Copyright by the 
American Chemical Society. 
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This paucity of symmetry types means that many of the correlation lines in the diagram 
refer to the same symmetry. Hence, it is not too surprising that some of our correlations 
run into conflict with the noncrossing rule. In Fig. 14-9, dashed lines are drawn from 30 
to 4a’ and from 17, to 3a’. These dashed lines connect the MO drawings in the manner 
expected if we ignore the noncrossing rule and simply bend the MOs along with the 
molecule. They are, as it were, intended correlations. But these lines are associated with 
the same symmetry a' and hence cannot cross. Instead we have an avoided crossing, as 
3o switches course and connects with 3a’ and 17r, goes to 4a’ (solid lines in Fig. 14-9). 
Avoided crossings are not uncommon in quantum chemistry, and they occur in curves 
referring to state energy as well as orbital energy. A generalized sketch exemplifying 
the idea is shown in Fig. 14-10. The actual energy change (as a function of bond length, 
angle change, or whatever process is occurring) may show an intermediate maximum 
or minimum as a result of the avoided crossing. (The dashed lines are energies we 
predict by "forgetting" to allow the two functions of the same symmetry to be mixed 
in the variational procedure. The error involved in this is small if the two functions 
are of dissimilar energy. As they grow closer in energy, the error grows worse, and the 
deviation between solid and dashed lines gets bigger as the dashed lines converge.) 

A classic example of such an intermediate maximum is seen in an excited state 
of H», illustrated in Fig. 14-11. Such maxima are important in understanding high- 
energy processes because they provide a means for some molecules to exist in bound 
vibrational states even while unstable with respect to dissociation products. Such states 
are called metastable states. 

The HAB Walsh diagram of Fig. 14-9 rationalizes the fact that the ten-valence 
electron HCN is linear in the ground state (due to 17y — 4a’) but bent in the first excited 
state. Such molecules as HNO, HNF, and HOCI, with 12-14 electrons, are bent in both 
ground and excited states because there is always at least one electron in the 5a' level. 

The final system we shall consider is the H3AAH3 system. We will show how 
QMOT can be used to understand why diborane (B2H6) has a bridged structure whereas 
ethane (C2H6) does not, why ethane prefers to be staggered (D34) rather than eclipsed 
(D3), and what geometry changes we might expect if ethane is forced into the eclipsed 
conformation. 


N 2^ Same symmetry 


State or orbital energy 


Geometrical change 


Figure 14-10 » Intended (dashed lines) and actual correlations between orbitals or states having 
the same symmetry. 
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Figure 14-11 » The four lowest states of H5. Note the curve for the first excited Ix state. This 
shape results from an avoided crossing. (From Sharp [8].) 


Gimarc's diagram relating MOs for A2Hg in D3q staggered and Dap bridged shapes 
is shown in Fig. 14-12. The MOs on the left are all H-A bonding and are arranged 
pretty much in the order one would expect on the basis of the A-A bonding. The lowest 
two are composed mainly of valence s AOs on atoms A. Next come the z bonding 
combinations, then the po bond, followed finally by the x antibonds. The leuz bonding 
levels lie below the 2Zajgpo bonding level because the former have greater ability to 
overlap with the hydrogens. 

There are only two MOs that show much energy change the molecule goes from 
D3q to Dan geometry. These are the 2ajg — 2ag and the leg — lb2g. In the former 
case, the energy drops because two hydrogens have moved from positions off axis of 
one p lobe into positions off axis of two p lobes, thereby increasing the total amount 
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Figure 14-12 » Walsh-type correlation diagram for D3q — D2p shapes of H3 AAH3. (After Gimarc 
[6].) Reprinted with permission from Accounts Chem. Res. 7, 384 (1974). Copyright by the American 
Chemical Society. 


of overlap. The energy in the latter case rises very markedly because the change in 
geometry places all six hydrogens into nodal planes, greatly reducing the overlap. 
The prediction is that a 10- or 12-valence electron A? Hg system should favor a bridged 
D» geometry over D3q but a 14-valence electron system should prefer D3q over Don. 
Diborane (12 valence electrons) and ethane (14) have structures consistent with this. 


EXAMPLE 14-4 What structure should be expected for B2H, ? 
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SOLUTION » Assuming that the extra electron occupies the 15»; MO, one might expect that 


B 2H, would have the D3q structure of ethane. It depends on whether or not one electron in the 
1b2g MO suffices to overcome the effects of the doubly occupied MOs. (Experiments and ab initio 


calculations indicate that ByH¢ has the same shape as ethane.) 4 


In structural problems such as this, one must be careful that, when comparing two 
possible molecular shapes, one is not overlooking other possibilities that might be even 
more stable. For instance, even though the diagram in Fig. 14-12 indicates that ethane 
should prefer D3q geometry to Don, it says nothing about D3q relative to Dan, the 
eclipsed form. For this we must construct another diagram, shown in Fig. 14-13. Here 
the principal energy changes occur in the doubly degenerate e- type MOs. Recall that 
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Figure 14-13 » Walsh-type diagram for D3q—D3, A2H6: (a) D3q (staggered); (b) D3p (eclipsed). 
Note that the e-type MOs for the two forms do not turn into each other by rotating about the C-C 
bond. This is really a two-sided correlation diagram, the high symmetry of each form determining 
the MOs. Then QMOT rules are used to decide how similar MOs on the two sides should relate in 
energy. (See Lowe [9].) 
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degenerate MOs need not be symmetric or antisymmetric for all symmetry operations 
of the molecular point group. This results, in this case, in the e-type MOs being much 
more unbalanced, or lopsided, than the nondegenerate a-type MOs. As a result, the 
changes in overlap population upon rotation are much bigger for e-than for a-type 
MOs. The higher-energy eg — e" set of MOs dominates the lower energy ey — e' set 
for two reasons; it is at higher energy and hence more sensitive to overlap change, 
and the coefficients on the hydrogens are bigger due to the central nodal plane, which 
reduces the size of the MO in the A-A bond, forcing it to be larger elsewhere. In sum, 
a 14-valence electron A2Hg molecule prefers the staggered (D3q) form because the 
long-range H---H antibonding in 1e" dominates the long range bonding in le’. We 
can describe this as *nonbonded repulsion" between hydrogens at opposite ends of the 
molecule. 

The energy changes in Fig. 14-12 are much larger than those in Fig. 14-13. In the 
former, we are charting energy changes associated with changes in bond angle and even 
molecular topology. Overlap changes are large and occur between nearest neighbors. 
In Fig. 14-13 we are charting energy changes associated with internal rotation. Here 
the overlap changes occur between third-nearest neighbors and are very small. The 
order in which the possibilities have been examined— D3q versus Dap followed by D3a 
versus D3,—is thus sensible in that we are considering the grosser energy changes first. 

One can go even further and guess the qualitative changes in C-C, C-H distances, 
and C-C-H angle if ethane is forced into the eclipsed conformation. We argue that the 
leg — le” pair of MOs suffer the greatest overlap change, losing population between 
the vicinal hydrogens. We must renormalize the MO to compensate for this loss, just 
as we had to in Hy , discussed earlier. To renormalize, the MO 1e" is multiplied by a 
factor slightly greater than unity. This magnifies the 2 antibond between the carbons 
and the bonding between carbon and hydrogens. Thus, we expect eclipsed ethane to 
have a slightly lengthened C-C bond, slightly shorter C-H bonds, and a larger C-C-H 
angle (the latter presumably mainly due to the increased vicinal repulsion brought 
about by overlap changes in the original rotation). Ab initio calculations? support these 
predictions. 
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We have indicated that higher-energy MOs tend to undergo more pronounced energy 
changes upon overlap change due to distortion of the nuclear frame. This fact has led 
to a shortcut method for guessing the results of full orbital correlation diagrams of the 
sort we have already discussed. One merely considers what the energy behavior will 
be for the highest occupied MO (HOMO) and bases the prediction entirely on that MO, 
ignoring all the others. Fukui® was the first to draw attention to the special importance 
of the HOMO. He also noted that, in certain reactions in which the molecule in question 
acted as an electron acceptor, the lowest unfilled MO (LUMO) of the molecule (before it 
has accepted the electrons) is the important one. These two MOs are called the frontier 
MOs. It sometimes happens that the second-highest-energy occupied MO undergoes a 


Stevens [10] finds that the C-C distance increases by 0.01 À, the C-H distance decreases by 0.001 À, and the 
C-C-H angle opens by 0.3?. 
6See Fujimoto and Fukui [11]. 
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Figure 14-14 » HOMO and LUMO frontier MOs and subjacent and superjacent MOs. 


much bigger overlap change than the HOMO does, and therefore dominates the process. 
Recognition of this fact has led to a special name for the MOs just below the HOMO and 
just above the LUMO. They are called subjacent and superjacent MOs (see Fig. 14-14). 

An example of the frontier MO approach is provided by reconsidering the staggered 
versus eclipsed conformation for ethane. We expect the highest-energy, lopsided MO 
to be the zr antibonding eg — e" degenerate pair. Qualitative molecular orbital theory 
rules lead us to expect this pair to have higher energy in the eclipsed form. Therefore, 
we expect ethane to be staggered. 

The same approach can be used to predict the conformation of dimethylacetylene, 
H3C-CzC-CH;. The HOMO is again a degenerate pair of x-type MOs. (Generally 
speaking, one assumes that occupied zt MOs are higher in energy than occupied o MOs, 
and this is often true. Even when it is not, however, the x-type MOs often tend to be 
more lopsided and hence to dominate because their overlap changes are greater.) The 
two degenerate HOMOs are delocalized over the entire molecule and can be expected 
to have the following characteristics: 


1. They will be orthogonal to each other. 


2. They will be bonding in the central C=C region, helping to establish multiple bond 
character there. 


3. They will be antibonding in the C-C single bond regions, thereby cancelling out 
double-bond character from a lower set of x-type MOs. 


4. They will be C-H bonding. 


[Use of rules (2)-(4) often suffices to establish the qualitative nature of HOMOs of 
hydrocarbons.] The results of all these conditions are the MOs sketched in Fig. 14-15. 
Observe that the end-to-end hydrogen overlap is most positive in the eclipsed confor- 
mation. This leads to the prediction that this molecule is more stable in the eclipsed 
conformation. Since the hydrogens are so far apart, the overlap change is expected to 
be very small. Ab initio calculations indicate that dimethylacetylene is more stable in 
the eclipsed form and that it has a barrier of less than 0.02 kcal/mole. 

As another example of frontier orbital usage, consider the methyl rotation barrier 
in propene, H3C-CH=CH>. Here the HOMO should be zr-bonding in the double 
bond, antibonding in the single bond, and C-H bonding in the methyl group. This 
MO is sketched in Fig. 14-16 for the two possible conformations. The end-to-end 
antibonding in this MO is greatest for case (b), and so conformation (a) is favored. 
Indeed, it has been observed that, in general, a threefold rotor attached to a double bond 
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Figure 14-15 » Degenerate HOMOs for dimethylacetylene, (a) eclipsed, (b) staggered, lead to the 
prediction that this molecule should prefer the eclipsed conformation. 


(a) (b) 


Figure 14-16 » HOMO of propene in two conformations. Methyl group (a) eclipses and (b) stag- 
gers the double bond. The energy is lower in (a). 


prefers to eclipse the double bond. A few examples are acetaldehyde (H3C-CH=O), 
N-methylformaldimine (H3C-N=CH2), nitrosomethane (H3C-N=O), and vinyl 
silane (H3Si-CH=CH)). 

Notice that the HOMO of Fig. 14-16 is qualitatively similar to one of the leg — 1e" 
HOMOs of ethane. The QMOT frontier orbital argument for the stability of staggered 
ethane is basically the same as that for the stability of form (a) in propene. Observe that 
this MO also resembles the HOMO of 1,3-butadiene, and would lead to the prediction 
that the trans form of this molecule is more stable than the cis. This is, in fact, observed 
to be the case. 

It is important to bear in mind that the frontier orbital approach is an approximation 
to an approximation. It is not always easy to know when one is on safe ground. Of 
the examples mentioned here, ethane and dimethylacetylene are safest because the 
overlap changes are small. Hence, the perturbation is slight, and our assumption that 
the MOs of the two forms are essentially identical, except for AO overlap changes, is 
quite accurate. Also, symmetry is high, and so we know that o MO overlap changes 
will be smaller than lopsided x MO overlap changes. Propene and butadiene are risky. 
Here the whole molecular framework is lopsided, so overlap changes are large in o as 
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well as in zz MOs. Indeed, if one performs EHMO calculations on these molecules, 
one finds that o-type MO energies change much more than does the 7 HOMO energy. 
This comes about because of the fairly close approach by some of the hydrogens in 
these molecules. Much of this ø energy change cancels out among the several o MOs. 
In propene, the cancellation is so complete that the zz HOMO energy change is almost 
the same as the total EHMO energy change. In butadiene, however, the 7 HOMO 
accounts for only about one-third of the total energy change. Even though the frontier 
orbital method has an astonishing range of qualitative usefulness (we shall see more 
applications shortly), it is clear that caution is needed. 


EXAMPLE 14-5 How should the bond lengths and angles of propene change if the 


methyl group rotates from the stable conformation (Fig. 14-16a) to the unstable 
one (Fig. 14-16b)? 


SOLUTION » The HOMO for form b has a bit more negative overlap, which requires a slightly 
larger normalizing coefficient. This increases the influence of the HOMO, so we expect a slight 
shortening of the two out-of-plane C-H bonds and the C=C bond, a lengthening of the C-C single 
bond, and a slight opening of the H-C-H angle for the out-of-plane C-H bonds. No changes 
are predicted for the in-plane C-H bond lengths or angles as a result of HOMO renormalization. 
(However, other MOs are also being renormalized, so other changes will result. However, the 
HOMO-induced changes should dominate.) < 
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It has been found possible to extend and amplify QMOT procedures so that they apply 
to chemical reactions. One of the most striking examples of this was application 
to unimolecular cyclization of an open conjugated molecule (e.g., cis-1,3-butadiene, 
closing to cyclobutene). This type of reaction is called an electrocyclic reaction. The 
details of the electrocyclic closure of cis-1,3-butadiene are indicated in Fig. 14-17. 

If we imagine that we can keep track of the terminal hydrogens in butadiene (perhaps 
by deuterium substitution as indicated in the figure) then we can distinguish between 
two products. One of them is produced if the two terminal methylene groups have 
rotated in the same sense, either both clockwise or both counterclockwise, to put the 
two inside atoms of the reactant (here D atoms) on opposite sides of the plane of the 
four carbon atoms in the product. This is called a conrotatory (cón' - ro - ta’ - tory) 
closure. The other mode rotates the methylenes in opposite directions (disrotatory) to 
give a product wherein the inside atoms appear on the same side of the C4 plane. 

A priori, we do not know whether the reaction follows either of these two paths. 
Figure 14-17 depicts processes where both methylene groups rotate by equal amounts 


" H H H " H 
H tas " disrotatory eee conrotatory 
\ / N 7 closure VA \ closure ES bae i p 
di ve E ^3 H=C—D D=C—H C C P cc ^, 


Figure 14-17 » Two idealized modes of electrocyclic closure of cis-1,3-butadiene. 
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as the reaction proceeds. This is an extreme case of what is known as a concerted 
process. The two processes occur together, or in concert. The opposite extreme is a 
nonconcerted, or stepwise process, wherein one methylene group would rotate all the 
way (90?) and only after this was completed would the other group begin to rotate. 
This process would lead to an intermediate having a plane of symmetry (ignoring the 
difference between D and H), which means that the second methylene group would be 
equally likely to rotate either way, giving a 50—50 mixture of the two products pictured 
in Fig. 14-17. 

One can make a case for the reaction having some substantial degree of concertedness 
(by which we mean that the second methylene should be partly rotated before the first 
one is finished rotating). The reaction involves destruction of a four-center conjugated 
7 system and formation of an isolated z bond and a new C-C ø bond. Energy is lost in 
the dissolution of the old bonds, and gained in formation of the new ones. Therefore, we 
expect the lowest-energy path between reactants and products to correspond to a reaction 
coordinate wherein the new bonds start to form before the old ones are completely 
broken. But the new o bond cannot form to any significant extent until both methylene 
groups have undergone some rotation. Thus, concertedness in breaking old bonds 
and forming new ones is aided by some concertedness in methylene group rotations. 
(Note that concertedness does not necessarily imply absence of an intermediate. If the 
reaction surface had a local minimum at a point at which both methylenes were rotated 
by 45°, it would not affect the argument at all.) 

Because they knew that many electrocyclic reactions are observed to be stereospecific 
(i.e., give ~100% of one product or the other in a reaction like that in Fig. 14-17), 
Woodward and Hoffmann [12] sought an explanation of a qualitative MO nature. They 
used frontier orbitals and argued how their energies would change with a con- or 
disrotatory motion, due to changes in overlap. For butadiene in its ground state, the 
HOMO is the familiar zt MO shown in the center of Fig. 14-18. The figure indicates that 
the interaction between p—r AOs on terminal carbons is favorable for bonding in the 
region of the incipient o bond only in the conrotatory case. Therefore, the prediction 
is that, for concerted electrocyclic closure, butadiene in the ground state should prefer 
to go by a conrotatory path. When the reaction is carried out by heating butadiene 
(thermal reaction), which means that the reactant is virtually all in the ground electronic 
state, the product is indeed purely that expected from conrotatory closure. 

One can also carry out electrocyclic reactions photochemically. The excited buta- 
diene now has an electron in a zt MO that was empty in the ground state. This MO was 
the lowest unoccupied MO (LUMO) of ground-state butadiene, pictured in Fig. 14-19. 
One can see that the step to the next-higher MO of butadiene has just introduced one 
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Figure 14-18 » The HOMO of ground state cis-1,3-butadiene as it undergoes concerted closure 
by either mode. 
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Figure 14-19 » The HOMO of the first excited state of cis-1,3-butadiene as it undergoes closure 
by either mode. 


more node, reversing the phase relation between terminal 2 AOs, and reversing the 
predicted path from con-to disrotatory. Experimentally., the photochemical reaction 
is observed to give purely the product corresponding to disrotatory closure. (lt is 
not always obvious which empty MO becomes occupied in a given photochemical 
experiment. One assumes that the LUMO of the ground state is the one to use, but 
there is some risk here.) 


EXAMPLE 14-6 Is the LUMO < HOMO transition dipole-allowed for cis-1,3- 


butadiene? 


SOLUTION » The HOMO is antisymmetric for reflection through the symmetry plane that 
bisects the molecule, and the LUMO is symmetric for this reflection. This is the only symmetry 
reflection plane where the MOs have opposite symmetry, so the transition is dipole-allowed (and is 
polarized from one side of the molecule towards the other). The group theory approach for this C2, 
molecule is that the HOMO has az symmetry, the LUMO has bı symmetry, their product has 54 
symmetry, and, since x also has by symmetry, the transition is allowed and is x-polarized (where 
x is colinear with the central C-C bond). 4 


One might worry about the fact that we are looking at only a part of one MO, thereby 
ignoring a great deal of change in other MOs and other parts of the molecule. However, 
much of this other change, while large, is expected to be about the same for either of the 
two paths being compared. The large overlap changes between p—r AOs on terminal 
and inner carbon atoms, for instance, are about the same for either mode of rotation. 
This approach, then, is focused first on the frontier orbitals, which are guessed as being 
most likely to dominate the energy change, and second on those changes in the frontier 
orbitals that will differ in the two paths. 

This method is trivially extendable to longer systems. Hexatriene closes to cyclo- 
hexadiene in just the manner predicted by the frontier orbitals. The only significant 
change in going from butadiene to hexatriene is that we go from four to six x electrons. 
This means that the HOMO for hexatriene has one more node than that for butadiene 
(or, the HOMO for a 2n 7r-electron system is like the LUMO for a 2n —2 x -electron 
system insofar as end-to-end phase relations are concerned). The net effect is that the 
predictions for hexatriene are just the reverse of those for butadiene. That is, hexatriene 
closes thermally by the disrotatory mode and photochemically by the conrotatory mode. 
The general rule, called a Woodward-Hoffmann rule, is this: the thermal electrocyclic 
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reactions of a k z-electron system will be disrotatory for k = 4q + 2, conrotatory for 
k = 4q (q =0, 1, 2,...); in the first excited state these relationships are reversed." 

It is possible to treat electrocyclic reactions in another way, namely, via a two-sided 
correlation diagram approach. This was first worked out by Longuet-Higgins and 
Abrahamson [14]. Only orbitals (occupied and unoccupied) that are involved in bonds 
being made or broken during the course of the reaction are included in the diagram. For 
butadiene, these are the four 2 MOs already familiar from simple Hückel theory. For 
cyclobutene, they are the two z MOs associated with the isolated 2-center 7 bond and 
the two c MOs associated with the new C-C o bond. These orbitals and their energies 
are shown in Fig. 14-20. 

In cyclobutene, the o and o* MOs are assumed to be more widely split than the z 
and zr * because the pg AOs overlap more strongly. Also, the c MO is assumed lower 
than zt, of butadiene. However, these details are not essential. All we have to be certain 
of is that we have correctly divided the occupied from the unoccupied MOs on the two 
sides. The dashed line in Fig. 14-20 separates these sets. 

Next we must decide which symmetry elements are preserved throughout the ideal- 
ized reactions we wish to treat. Let us consider first the reactants and products. These 
have Co, symmetry, that is, a two-fold rotational axis, C2, and two reflection planes o1 


. goby — 
ra. 


(a) (b) 


Figure 14-20 » MOs associated with bonds being broken or formed in the electrocyclic closure of 
(a) cis-1,3-butadiene to (b) cyclobutene. 


7See Woodward and Hoffmann [13, p. 45]. 
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Figure 14-21 » Sketches illustrating that the conrotatory mode (b) preserves the C» axis while the 
disrotatory mode (a) preserves the reflection plane o. 


and o» containing the C» axis (see Fig. 14-21). A conrotatory twist preserves C», but, 
during the intermediate stages between reactant and product, o, and o» are lost as sym- 
metry operations. A disrotatory twist preserves o, but destroys C? and o». Therefore, 
when we connect energy levels together for the disrotatory mode, we must connect 
levels of the same symmetry for o1, but for the conrotatory mode, they must agree 
in symmetry for C2. The o» plane applies to neither mode and is therefore ignored. 
The symmetries for each MO are easily determined from examination of the sketches 
in Fig. 14-20, and are given in Table 14-3. These assignments lead to two different 
correlation diagrams, one for each mode. It is conventional to arrange these as shown 
in Fig. 14-22. 


TABLE 14-3 » Symmetries for Co, MOs 


MO O71 C 2 MO [^5] C 2 
Butadiene Cyclobutene 
7 S^ A o S S 
T2 A S T S A 
T3 S A m* A S 
T4 A S o* A A 


4S is symmetric; A antisymmetric. 
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Figure 14-22 p A pair of two-sided correlation diagrams (one for each mode) for the electrocyclic 
reactions of cis-1,3-butadiene: (a) cyclobutene; (b) butadiene; (c) cyclobutene. 


There is curve crossing in these diagrams, but it is always lines of different symmetry 
that cross, and so no violation of the noncrossing rule occurs. 

If we are considering a thermal reaction, the lowest two m MOs of butadiene are 
occupied. These correlate with the lowest two MOs of cyclobutene if the conrotatory 
mode is followed, and the thermal conversion of cis-butadiene to cyclobutene by a 
conrotatory closure is said to be symmetry allowed. The other mode correlates 72 with 
an empty cyclobutene MO (zr*). Taking this route moves the reactant toward doubly 
excited cyclobutene. (Even though we might anticipate de-excitation somewhere along 
the way, the energy required in early stages would still be much higher than would 
be needed for the symmetry-allowed mode.) This is said to be a symmetry-forbidden 


reaction. 


If we now imagine photo-excitation of cis-butadiene to have generated a state asso- 


2 


ciated with the configuration 771772773, and trace the fate of this species for the two 
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modes of reaction, we note that the disrotatory route leads to cyclobutene in the con- 
figuration c? z* while the conrotatory mode gives oo*. Both of these are excited, 
but the former corresponds to the lowest excited configuration (7 — z*) while the 
second corresponds to a very high-energy excitation (o — o*). Therefore, the former 
is "allowed" (since it goes from lowest excited reactant to lowest excited product) and 
the latter is "forbidden." 

The two-sided correlation diagrams of Fig. 14-22 thus lead to the same predictions 
as the frontier orbital maximization of overlap approach. The difference between 
these approaches is as follows: The frontier-orbital approach requires sketching the 
HOMO and then judging overlap changes upon nuclear motion using QMOT reasoning. 
The two-sided correlation diagram approach requires sketching all the MOs (occupied 
and unoccupied) of both reactant and product involved in bonds breaking or forming, 
ordering the corresponding energy levels, and finding symmetry elements preserved 
throughout the reaction. Once all this is done, the levels are connected by correlation 
lines without reliance on QMOT reasoning. Some qualitative reasoning enters in the 
ordering of energy levels (levels with more nodes have higher energy), but the two-sided 
correlation diagram technique is the more rigorous method of the two and tends to be 
preferred whenever the problem has enough symmetry to make it feasible. Reliance 
on frontier orbitals is more common for processes of lower symmetry. 

Concern is sometimes expressed about the apparent restrictions resulting from use 
of symmetry in correlation diagram arguments. One can imagine the butadiene cycliza- 
tion occurring with less-than-perfect concertedness, the two methylene groups rotating 
by different amounts as the reaction proceeds. But that would destroy all symmetry 
elements. Will our symmetry-based arguments still pertain to such an imperfectly con- 
certed reaction coordinate? Again, if we label certain sites by substituting deuteriums 
for hydrogens as shown in Fig. 14-17, the symmetry will be destroyed. Do our pre- 
dictions still apply? One can answer these questions affirmatively by reasoning in the 
following way. If we had a collection of nuclei and electrons, and we could move 
the nuclei about in arbitrary ways and study the ground-state energy changes, expe- 
rience tells us that the energy would be found to change in a smooth and continuous 
way. We can think of the energy as a hypersurface, with hyperdimensional “hills,” 
“valleys,” and “passes.” Now, in a few very special nuclear configurations, identical 
nuclei would be interrelated by symmetry operations, and we would be able to make 
deductions on group-theoretical grounds. Such deductions would only strictly apply to 
those symmetric configurations, but they would serve as indicators of what the energy 
is like in nearby regions of configuration space. Thus, the correlation diagram indi- 
cates that a perfectly concerted thermal electrocyclic reaction of butadiene will require 
much less energy to go conrotatory as opposed to disrotatory. The inference that a 
less-perfectly concerted reaction will have a similar preference is merely an assump- 
tion that it is easier to pass through the mountains in the vicinity of a low pass than 
a high one. Experience also leads us to expect that substituting for H a D (or even a 
CH3) will have little effect on the MOs, even though, strictly speaking, symmetry is 
lost. In essence, we work with an ideal model and use chemical sense to extend the 
results to less ideal situations, just as we do when, in applying the ideal gas equation 
of state to real gases, we avoid the high-pressure, low-temperature conditions under 
which we know the oversimplifications in the ideal gas model will lead to significant 
error. 
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It is possible to combine information on orbital symmetries and energies to arrive 
at state symmetries and energies. Then one can construct a correlation diagram for 
states. We now demonstrate this for the dis- and conrotatory reactions just considered. 

Each orbital occupation scheme is associated with a net symmetry for any given sym- 
metry operation. Character tables could be used to assign these symmetries, but this is 
not necessary. All we need to use is the fact that, in multiplying functions together, sym- 
metries follow the rules: Sx S=S,A x A=S,S x A— A. Thus, any doubly occupied 
MO in a configuration will contribute symmetrically to the final result. To ascertain 
the net symmetry, then, we focus on the partly filled MOs. The symmetries for C? and 
c1 of ground and some excited configurations of butadiene and cyclobutene are listed 
in Table 14-4. Included are the cyclobutene configurations that result from intended 
correlations of various butadiene configurations. (For instance, "ln? butadiene has an 


TABLE 14-4 » Symmetries and Intended Correlations of Some Configurations 
of Butadiene and Cyclobutene 


Cyclobutene 
Symmetry for "intended" configuration 

Configuration 01 C5 Con Dis 
Butadiene 

11222? ga S 529? Bene 

12702703 A A z?oo* o?n*zx 

71770704 S S n on* o?z*o* 

71702? T3 S S zo?o* on? n 

NIN N4 A A zo?n* on*o* 

m? S S gg on? 
Cyclobutene 

orn? S S 

o?nnz* A A 

on’ n* A S 

on?gz* A S 

on o* A A 

on™ S S 

o?n*o* S A 


^S is symmetric; A antisymmetric. 


8 Actually, we shall be looking at simple products of MOs, or configurations. Each configuration is associated 
with one or more states and gives the proper symmetry for these states as well as an approximate average energy 
of all the associated states. Hence, the treatment described here gives a sort of average state correlation diagram. 
It might be more accurately called a configuration correlation diagram. 
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intended correlation with o72** cyclobutene if the disrotatory mode is followed. This 
is inferred from the orbital correlation diagram, Fig. 14-22.) 

Assuming that the energies of states associated with these configurations fall into 
groups roughly given by sums of orbital energies, we obtain the two-sided diagram 
shown in Fig. 14-23. Only a few of the configurations are interconnected, to keep 
the diagram simple. Note that the ground-state configuration of butadiene correlates 
directly with the ground state of cyclobutene for conrotatory closure, but has an intended 
correlation with a doubly excited configuration in the disrotatory mode. This intended 
correlation would violate the noncrossing rule by crossing another line of S symmetry, 
so that the actual curve turns around and joins onto the ground state level for cyclobutene. 
The effect of the intended correlation with a high-energy state is to produce a significant 
barrier to reaction. The figure shows that, for the first excited configuration, the high- 
energy barrier occurs for the opposite mode of reaction. State correlation diagrams 
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Figure 14-23 » A state or configuration correlation diagram for the electrocyclic closure of cis- 
1,3-butadiene. 
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thus convert a “symmetry-forbidden” orbital correlation diagram into a high-activation- 
energy barrier: the conclusions are the same using either diagram. 

Another kind of reaction that is formally closely related to the electrocyclic reaction 
is the cycloaddition reaction, exemplified by the Diels—Alder reaction between ethylene 
and butadiene to give cyclohexene (I). Such reactions are classified in terms of the 
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number of centers between the points of connection. Thus, the Diels—Alder reaction 
is a [4 + 2] cycloaddition reaction. One can conceive of several distinct geometrical 
possibilities for a concerted mechanism for such a reaction. The two new o bonds can 
be envisioned as being formed on the same face (suprafacial) (II) or opposite faces 
(antarafacial) (IIT) of each of the two reactants. The various possibilities are illustrated 
in Fig. 14-24. Qualitative MO theory is used to judge which process is energetically 
most favorable. One has a choice between the two-sided correlation diagram and the 
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frontier orbital approach. We demonstrate the latter? since it is simpler. Both methods 
lead to the same conclusion. In the course of this reaction, electrons become shared 
between the x systems of butadiene and ethylene. This is accomplished, to a rough 
approximation, by interaction between the HOMO of butadiene and the LUMO of 
ethylene and also between the LUMO of butadiene and the HOMO of ethylene. Let us 
consider the former interaction. The MOs are shown in Fig. 14-25 and the overlapping 
regions are indicated for the four geometric possibilities. Inspection of the sketches 
indicates that the two MOs have positive overlap in the regions of both incipient o 
bonds only for the [4s + 2s] and [4a 4- 2a] modes. Therefore, the prediction is that these 
modes proceed with less activation energy and are favored. Now let us turn to the other 
pair of MOs, namely the LUMO of butadiene (IV) and the HOMO of ethylene (V). 
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9See Hoffmann and Woodward [15]. 
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[4s + 2s] [4a + 2a] 
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[4a + 2s] [4s + 2a] 


Figure 14-24 » Four suprafacial-antarafacial combinations possible for the Diels-Alder 2 + 4 
cycloaddition reaction. 


[4a + 2s] [4s + 2a] 


Figure 14-25 » Overlaps between HOMO of butadiene and LUMO of ethylene resulting from four 
interactive modes pictured in Fig. 14-24. The geometries for the four modes would all differ. These 


drawings are highly stylized. 


Note that, for each MO, the end-to-end phase relationship is reversed from what it was 
before. Two symmetry reversals leave us with no net change in the intermolecular phase 
relations. Itis easy to see, therefore, that these MOs also favor the [s, s] and [a, a] modes. 
In cycloaddition reactions of this sort, one need analyze only one HOMO-LUMO pair 
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in order to arrive at a prediction. Extension to longer molecules or to photochemical 
cycloadditions proceeds by the same kinds of arguments presented for the electrocyclic 
reactions. 

Another type of reaction to which qualitative MO theory has been applied is the 
sigmatropic shift reaction, where a hydrogen migrates from one carbon to another and 
simultaneously a shift in the double bond system occurs. An example is given in 
Fig. 14-26. 

The usual treatment of this reaction!” involves examining the HOMO for the system 
at some intermediate stage in the reaction where the hydrogen has lost much of its 
bonding to its original site and is trying to bond onto its new site. At this stage, the 
HOMO of the molecule becomes like that of the nonbonding MO of an odd alternant 
hydrocarbon (Fig. 14-27) with a slightly bound hydrogen on one end. The suprafacial 
mode is favored in this particular case because the hydrogen can maintain positive 
overlap simultaneously with its old and new sites—the new bond can form as the old 
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Figure 14-26 » The two possible distinct products resulting from a shift of a hydrogen from position 
1 to position 5 in a substituted 1,3-pentadiene. Groups A, B, C, D are deuterium atoms, methyl groups, 
etc., enabling us to distinguish the products. 


(b) 


Figure 14-27 » Phase relations in the HOMO for (a) suprafacial and (b) antarafacial [1, 5] 
sigmatropic shifts. 


10See Woodward and Hoffmann [16]. 
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bond breaks. This is not possible for the antarafacial [1, 5] shift. However, the [1, 7] 
shift prefers the antarafacial mode. 

Many other types of chemical reaction have been rationalized using qualitative MO 
theory. The association of Sy2 reactions with Walden inversion (i.e., the adding group 
attacks the opposite side of an atom from the leaving group) is rationalized by arguing 
that an approaching nucleophile will donate electrons into the LUMO of the substrate. 
The LUMO for CH3CI is shown in Fig. 14-28. A successful encounter between CH3Cl 
and a base results in a bond between the base and the carbon atom, so the HOMO of 
the base needs to overlap the p AO of carbon in the LUMO of Fig. 14-28a. Attack at 
the position marked “1” in the figure is unfavorable because any base MO would be 
near a nodal surface, yielding poor overlap with the LUMO. Therefore, attack at site 
2 is favored. As the previously empty LUMO of CH3CI becomes partially occupied, 
we expect a loss of bonding between C and Cl. Also, negative overlap between the 
forming C-base bond and the three “backside” hydrogens should encourage the latter 
to migrate away from the attacked side, as indicated in Fig. 14-28b. 

The tendency of a high-energy, occupied MO of the base to couple strongly with 
the LUMO of a molecule like CH3CI depends partly on the energy agreement between 
these MOs. If they are nearly isoenergetic, they mix much more easily and give a 
bonded combination of much lower energy. Molecules where the HOMO is high tend to 
be polarizable bases. A high-energy HOMO means that the electrons are not very well 
bound and will easily shift about to take advantage of perturbations. Such bases react 
readily with molecules having a low-energy LUMO (Fig. 14-292). This corresponds to a 
"soft-base-soft-acid" interaction in the approach of Pearson [17]. When the HOMO and 
LUMO are in substantial energy disagreement, orbital overlap becomes less important 
as a controlling mechanism, and simple electrostatic interactions may dominate. This 


(a) 


Figure 14-28 » (a) The LUMO of CH3Cl. (b) Positive overlap between HOMO of base B and 
LUMO of CH3Cl increases antibonding between C and Cl and also repels H atoms from their original 
positions. 
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Lewis acid Complex Lewis base 


(a) 


(b) 


Figure 14-29 » LUMO-HOMO interactions and splitting for (a) nearly degenerate levels, (b) well- 
separated levels. 


is a "hard-acid-hard-base" situation. Thus, we expect QMOT rules to apply to soft-soft, 
rather than hard-hard interactions. 

The reader may, by this time, begin to appreciate the very wide scope of QMOT 
and the large number of variations of a common theme that have been used. In this 
chapter we have given only a few representative examples. We have not described 
all the variations or all types of application. For fuller treatment, the reader should 
consult specialized books on this subject, some of which have been referred to in this 
chapter [18]. 


14-9.A Problems 


14-1. Calculate to first order the electronic energy of a hydrogen atom in its 1s state 
and in the presence of an additional proton at a distance of 2 a.u. What is the 
total energy to first order? Repeat for distances of 1 and 3 a.u. (See Appendix 3.) 


14-2. Evaluate and graph the effects of dividing Haa + Hap by 1+ S for each of the 
following cases: Haa = 0, —5, —10, 4-10, —20. In each case, let Hj, = —5, 
S = 0.5. Does the QMOT rule that antibonding interactions are more 
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14-3. 


14-4. 


14-5. 


14-6. 


14-7. 


14-8. 
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destabilizing than bonding interactions are stabilizing apply in all cases? Is the 
bonding level always the lower of the two? Does the QMOT expectation appear 
to be better followed by very low-energy levels, or by higher-energy levels? 


Table P14-3 is a list of electron affinities (in electron volts) of certain molecules 
and atoms. Can you rationalize the molecular values relative to the atomic values 
using QMOT ideas? 


TABLE P14-3 » Atomic and Molecular Electron Affinities (in electron volts) 


H (0.75) Cl (3.61) Co (3.4) O2 (0.45) Ch (2.38) 
C (1.26) Br (3.36) CN (3.82) F, (3.08) Br» (2.6) 
N(0.0+0.2)  1(3.06) N (—16) S» (1.67) L (2.55) 
O (1.46) S (2.08) CO(«-1.8  SO(1.13)  ICI(143) 
F (3.40) Ho (~—2)  CS(021) FCI (1.5) IBr (2.6) 


For some time, it was uncertain whether the ground states of CH» and NH; 
are singlets Qa)? or triplets (2a1) (154). The ground-state geometries of these 
systems have HAH angles of 134? (CH2) and 140-150? (NH; ). Based on other 
data described in the text for HAH systems, would you say these angles are more 
consistent with a singlet or a triplet ground state? Assuming that the first excited 
state is the other multiplicity, should the first excited state be more or less bent 
than the ground state? 


Based on Fig. 14-8, what should happen to the geometry of H20 upon 3a; < 1^1 
excitation? 


Carry out EHMO calculations for CH» at HCH angles of 180°, 150°, 120°, 
and 90°. (Use a constant C-H bond distance of about 1 A in all cases.) From 
an examination of the MO coefficients, sketch and assign symmetry symbols 
to each MO. Plot the energies versus angle. Critically discuss your computed 
results compared to Fig. 14-8. If there are differences, try to rationalize them. 
(Remember that any EHMO orbital energy change can be analyzed in terms of 
Mulliken population changes, as discussed in Chapter 10.) 


Using the EHMO energy formula [Eq. (10-25)], analyze the contributions to 
the orbital energy change between 180° and 120° that you calculated for the 
1b2 MO in Problem 14-6. What percentage of the energy change comes from 
loss of overlap between 1s and 2p AOs? From antibonding between hydrogens? 
Compare this with the discussion in the text. 


If overlap between an s AO and a p AO goes as cos @ (for constant and finite 
R) (Fig. P14-8), what is the mathematical expression for the rate of change of 
overlap with angle? Calculate the effect on overlap of a 30° shift, starting from 
0 =0. Calculate the effect of a 30° shift from 6 = 90°. 
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Figure P14-8 » 


14-9. Use symmetry to help establish an energy level pattern and MO sketches for 
planar AH3 (equilateral triangular). Use QMOT rules to produce the correlation 
diagram for planar versus pyramidal AH3. Based on your reasoning, which of 
the following should be planar? BH3, CHi, BeH; , NH3, PHs, H3O*, CH; . 
Can you think of any other shapes that might be examined as possibilities for 
AH3 systems? 


14-10. Use an EHMO program to generate MOs for CO» at 180°, 150°, 120°, and 90°. 
Sketch the MOs, characterize their symmetries, and construct an orbital energy 
correlation diagram for this molecule. Indicate what causes each MO energy 
to rise or fall. Based on your figure, would you expect the following molecules 
to be linear or bent? BeCl?, C3, CO2, N3 , NO2, O5, F20. 


14-11. Should a zy < Tg electronic transition for ozone cause the O-O-O angle to 
increase or decrease according to Walsh-type arguments? Sketch the MOs and 
indicate your reasoning. [The notation zg and zy refers to the MOs in the 
linear molecule. For the equilibrium bent structure, these MOs are: mg > 
a2, bo; Tu > d, bi.] 


14-12. Consider the electrocyclic reaction wherein the allyl anion closes to form a 
cyclopropenyl x anion (VIII). (The negative charge in the cyclopropyl! anion 
may be thought of as resulting from double occupancy of a p-x AO on the 
singly protonated carbon.) Sketch the orbitals being formed or destroyed in 
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this process. Determine the orbital symmetries for the symmetry operations 
conserved in conrotatory and disrotatory modes of closure. Set up an orbital 
correlation diagram and decide which mode is more likely for thermal and 
photochemical reactions. 


14-13. Construct an orbital correlation diagram for the “broadside” 2 + 2 cycloaddition 
of two acetylenes to form cyclobutadiene (IX). Is the reaction likely to proceed 
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(IX) 


through an intermediate of square planar geometry? Assuming this geometry, 
would reaction be easier thermally or photochemically? 


14-14. For a system having no symmetry elements (except E), what is the result of the 
noncrossing rule? 


14-15. a) Construct an orbital correlation diagram for the 2 + 4 cycloaddition 
(Diels—Alder) reaction discussed in the text. 
b) Construct a state correlation diagram for this reaction. 


14-16. Based on the orbital relations discussed in the text and extensions of these rela- 
tions to other cases, formulate generalized verbal rules (Woodward—Hoffmann 
rules) for cycloadditions and sigmatropic shift reactions. 
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Chapter 15 


Molecular Orbital Theory 
of Periodic Systems 


[J 15-1 Introduction 


A structure is periodic in space, and hence has a periodic potential, if a subunit of 
the structure can be found that generates the entire structure when it is repeated over 
and over while traversing one or more spatial coordinates. Thus, a regular polymer 
can be "generated" mentally by translating a unit cell along an axis, sometimes with 
accompanying rotations. A crystalline solid results from such translations along three 
coordinates. In an analogous sense, some molecules are periodic. Benzene is an 
example since it can be generated by rotating a C-H unit in 60? increments about a 
point which ultimately becomes the molecular center. 

We have already seen (Chapters 8 and 13) that the orbital energies, degeneracies, 
and coefficients in benzene are to a great extent determined by symmetry. But these 
symmetry constraints can also be viewed as resulting from the cyclic "periodicity" of 
benzene, and this finite cyclic periodicity is, in essential ways, like the extended infinite 
periodicity of, say, regular polyacetylene or of graphite. It is reasonable, therefore, that 
some of the aspects of the energies and orbitals in these extended periodic structures 
are closely related to those in cyclic periodic systems. 

The concepts commonly used to describe periodic polymers, surfaces, or solids are 
closely related to those we have already developed for molecules, but their terminology 
and depiction are not familiar to most chemists. Our goal in this chapter is to develop 
an understanding of these concepts by progressing from more familiar cyclic systems, 
like benzene, to polymers, and to show how qualitative MO concepts apply to periodic 
systems in general. 


E 15-2 The Free Particle in One Dimension 


We will first examine the particle moving parallel to the x coordinate with no variation 
in potential energy. (Let V = 0.) This system is periodic because every segment of x 
is the same as every other, and it provides a convenient starting point for discussion of 
periodic systems in general. 
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The quantum-mechanical solutions for this system are discussed in Chapter 2. The 
relevant points are: 


1. All non-negative energies are possible because there are no boundary conditions 
(beyond the conditions applying to well-behaved functions). 


2. Except for E — 0, all energy levels are doubly degenerate. 


3. The pair of linearly independent wavefunctions associated with each doubly degen- 
erate energy E can be combined in an infinite number of ways to produce different 
resultant wavefunction pairs. Two of these pairs are especially convenient. These 
are 


W4 = exp a (15-1a) 
EE (sib) 
and 
Vsin = Sin ar | (15-2a) 
Weos = COS KA (15-2b) 


4. The exponential forms [Eqs. (15-1ba, b)] are also eigenfunctions for the momentum 
operator, p, = (h/i)d/dx. Thus y+ corresponds to a particle moving parallel to 
the x axis with momentum 4-4/2m E (i.e., toward x = +00), and y_ corresponds 
to motion toward x — —oo. We can associate the double degeneracy of the energies 
with the fact that there is no difference in the potential felt by the particle, regardless 
of whether it moves from left to right or right to left: The system has two equivalent 
directions. 


5. The real forms [Eqs. (15-2ba, b)] give oscillating particle distributions (except when 
E — 0). In a given pair, the sine puts maximum particle density where the cosine 
puts its minimum, and vice versa. As E increases, the frequency of nodes increases. 
These wavefunctions are not "pure" momentum states because they are not eigen- 
functions of f, (except when E — 0). They are formed by mixing of the exponential 
forms. However, these functions are sometimes more convenient to work with, 
especially pictorially, and they are just as good as the exponential forms as long as 
we are concerned with energy rather than momentum. 


Equations (15-1) and (15-2) are sometimes written with v2m E/h replaced by the 
symbol k, with k having any value from zero to infinity. Evidently, k in this system is 
proportional to particle momentum, hence to the square root of the kinetic energy, T. 
(Since V — 0, the kinetic energy is identical to the total energy, E.) For a free particle, 
de Broglie’s relation holds, so p is proportional to 1/A, where A is the de Broglie 
wavelength. This means that k is proportional to the number of de Broglie wavelengths 
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(a) (b) (c) 


Figure 15-1 » Three ways of picturing the energies of the free particle. (b) and (c) show the 
dependence of E on k. In (c), the energies are understood to be degenerate (except at E = 0). 


per unit of distance. Consequently, k is often referred to as the wavenumber. It is 
the de Broglie-wave-equivalent to the wavenumber of light or of any other classical 
harmonic wave. 

Summarizing, for the free particle, 


ko p, x Y E 2 AT «x 1/A (15-3) 


There are several ways in which the solutions of the Schródinger equation for peri- 
odic structures are displayed graphically. One way is the familiar one of drawing all 
the energy levels against a vertical energy scale, as in Fig. 15-1a. (Since all energies 
are allowed for the free particle, this gives a single energy at E = 0 and a degener- 
ate continuum, or "band", from E > O0 to co, rather than a series of discrete lines.) 
Alternatively, we can plot E versus k to obtain the parabolic plot of Fig. 15-1b. This 
can be simplified, though, by plotting only one arm of the parabola (Fig. 15-1c), mak- 
ing implicit the fact that, for each solution with wavenumber k, there is a degenerate 
solution with wavenumber —k. Graphs that relate energy to k for periodic polymers, 
surfaces, or solids are called band diagrams. Figure 15-1c is the band diagram for the 
one-dimensional free particle. 

Another quantity of great physical importance is the number of states near a particular 
energy value. This is called the "density of states" (DOS). If we consider Fig. 15-1c, 
we can see that the states associated with 0 < |k| < 1 all lie within a certain range which 
we call Eo — E, in Fig. 15-2a. If we assume that all values of k are equally likely (we 
show this to be true in Section 15-5), then there is an equal (infinite) number of states 
associated with 1 < |k| < 2, and these lie in the larger range E, — E2. Hence, the states 
in the range E, — E» are less densely packed than those in the range Eg — Ej. It is 
not hard to show that the density of states for the free particle drops off as 1/A/ E, as 
plotted in Fig. 15-2b. 

Ultimately we will consider one-dimensional periodic structures (polymers) with 
varying potentials caused by the presence of nuclei and other electrons, and we will find 
that these systems retain some of the above features. In particular, degeneracy due to 
directional equivalence and choice of real or complex forms for orbitals persist. Also, 
the quantity k retains its meaning as a wavenumber (in a restricted sense). However, 
k loses its simple relation to momentum and energy (because the electrons possess 
potential energy as well as kinetic energy), and, typically, certain energies become 
disallowed. This changes the infinite band to band segments separated by gaps in energy. 
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Figure 15-2 » The states between |k| = 0 and 1 are equal in number to those between |&| = 1 and 
2, but the former set is packed into a smaller energy range, producing a greater density of states. 


| J 15-3 The Particle in a Ring 


A particle of mass m constrained to move in the angular coordinate ¢ about a ring 
of radius r with V ($) =0 is the cyclic analog of the free particle. This system, also 
described in Chapter 2, is similar in many ways to the one just discussed. Except for the 
case of E =Q, all solutions are doubly degenerate and describable with real (trigono- 
metric) or complex (exponential) functions. The quantum number j (we will use j in 
cyclic systems, Kk in linear systems) is proportional to the angular momentum, to the root 
of the energy, to the reciprocal wavelength, and to the number of de Broglie waves in 
one circuit of the ring. However, since the number of waves in the ring must be integral, 
we have a periodic boundary condition that restricts j to integer values and leads to a 
discrete energy spectrum as opposed to the continuous spectrum of the free particle. 

The diagrams summarizing relations between energies, j, and number of states are 
collected in Fig. 15-3. 
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Figure 15-3 » (a) The energy for a particle in a ring has parabolic dependence on j, but exists only 
when j is an integer. (b) The number of states versus energy. This is the discrete-state analog of the 
density-of-states plot for a continuum of energies. 
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Cyclic structures with varying potentials exist (e.g., benzene), and such systems 
retain the degeneracy and real or complex orbital features. Also, j continues to have 
meaning (restricted) with respectto wavelength and number of nodes. However, angular 
momentum and energy are no longer simply related to j. 


[p 15-4. Benzene 


We next examine benzene—a cyclic system having a nonuniform potential. We consider 
only the z electrons as described by the simple Hückel method, since the features we 
wish to point out are already present at that elementary level. A formula is presented 
(without derivation) in Chapter 8 [Eq. (8-52)] for the MOs of molecules like benzene. 
This gives, for the jth MO of benzene, 


6 
$;— > 4167" expI2zrij(n — D/6])2p; (n), j=0,1,...,5 (15-4) 


n=1 


where i = 4/ —1 and 2p, (n) is a 2p, atomic orbital centered on the nth carbon atom. 
The energy for MO @; is equal to [Eq. (8-51)] 


€j =a t 28 cos(27: j/6). (15-5) 


If j =0, all the exponentials equal one and Eq. (15-5) gives eo =a + 28, which can 
be recognized as the lowest-energy z MO, and Eq. (15-4) gives ġo = (1/ V 6)[2ps (1) 4- 
2px (2) + 2px (3) + 2p4 (4) + 2px (5) + 2px (6)]. This gives all the 2p, AOs the same 
phase, which is what we expect for the totally bonding, lowest-energy 2 MO. 

If j= 1,€1 =a+ 2p cos(7T/3) =a+26(1/2)=a+ B. This is an energy in the 
second-lowest level, which is a degenerate level. The corresponding MO is ¢; = 
(1/V6){lexp(0)]2px(1) + [expGri /3)2pa (2) + lexpQzi/3)]2ps (3) + [expGri)] x 
2px (4) + lexp(471/3)]2pzs (5) + [exp(5zi/3)]2p« (6)). This somewhat formidable- 
looking MO is complex. Because $, is one of a degenerate pair of MOs (the other one 
is $5), we can mix the complex MOs to obtain real ones that are still eigenfunctions. 
The real forms of the benzene MOs are described in Section 8-10D. The MO formulas 
produced by Eq. (15-4) are real for nondegenerate cases but are usually complex for 
degenerate MOs. However, it is always possible to mix any pair of degenerate complex 
MOs to produce a pair of degenerate real MOs. Notice that the complex MOs place 
the same electron density at each carbon whereas the real forms do not. The real MOs 
show nodes at various points in the ring (see Fig. 8-13) just as the real wavefunctions 
for the free particle show nodes at various points in x. 

In these formulas, j is restricted to the integer values ranging from 0 to 5, giving six 
MOs. If one tries other integer values of j in formulas (15-4) and (15-5), one simply 
reproduces members of the above set. Indeed, any six sequential integer values for 
j produces the same set of solutions as does the sequence 0—5. In particular, the set 
—2, —1,0, 1, 2,3 is perfectly acceptable and provides a match with the conventions of 
solid-state physics and chemistry. 

The energies for the six unique MOs of benzene are reproduced over and over again 
if we allow the index j to run beyond the specified range. This is depicted in Fig. 15-4a, 
and it is easy to see that the same six energies result for any six contiguous j values. 
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Figure 15-4 » (a) Energies for benzene as a function of j. (b) Unique energies of benzene. 
(c) Number of states at each energy. All data refer to x energies at the simple Hückel level. 


The set from —2 to 3, alluded to above, can be replotted in condensed form over the 
range 0-3 (Fig. 15-4b) if we keep in mind the fact that the energies are degenerate 
except for the first and last. The “number of states" diagram for benzene is sketched in 
Fig. 15-4c. 

If we consider a monocycle with thousands of carbon atoms, Eq. (15-5) gives (con- 
densed) results as sketched in Fig. 15-5a. The very large number of energies still lie 
in the range o + 2f to a — 26 with the cosine wave now stretched over a much larger 
range of integers. The well-separated points of Fig. 15-4b coalesce into the line of 


a- 2p a- 2p 
a— Bp 
BR E a 
atf 
at 2B at 2B 
0 NI2 0 05 10 

/ S in |B|? 
(a) (b) 


Figure 15-5 » (a) Hückel energies and (b) density of states for a very large number of carbons in 
a cycle. 
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Fig. 15-5a, which now appears almost continuous. The shape of the curve in Fig. 15-5a 
makes it evident that the states around j — 0 and j — N/2 are closer together in energy 
than are those around j = N /4, giving the density of states curve shown in Fig. 15-5b. 

There is a qualitative difference between what we found for the particle in a ring and 
benzene. In the former case, k can increase without limit, and energy keeps increasing 
as the square of k. In benzene, increasing j beyond the prescribed range simply causes 
the energy to cycle back and forth between o + 28 and a — 28. Why do these systems 
behave so differently in this regard? The energy of the free particle increases as we 
fit more and more waves into a circle of fixed radius, obtaining, therefore, shorter de 
Broglie wavelengths. The Hückel energy of a benzene m MO depends on the extent of 
bonding and antibonding character between adjacent m AOs. When j is zero, all the 
interactions are bonding and e =a -- 28. As j increases, bonding interactions disappear, 
ultimately to be replaced by antibonding interactions. At j — 3, all interactions are 
antibonding and e =a — 2f. It certainly makes sense that this is the highest energy 
an MO can have because this is the most antibonding arrangement imaginable. But 
what happens mathematically to make this work out? Let us examine the exponential 
functions in Eq. (15-4) for two values of j, say j =3 and j =9, to see if we can resolve 
this question. An immediate problem confronts us: The function is complex, and 
sketching a complex function is not convenient. However, because the complex MOs 
can always be mixed to form real MOs, we can let the mixing occur within Eq. (15-4) 
itself to give sine and cosine equivalents. For example, 


6 
Pjcos= 3 167"? cos2x j (n — )/612ps (n). (15-6) 


nel 


(For some values of j, this expression will yield MOs that are not normalized.) Equa- 
tion (15-6) tells us that MO $; has six terms, each term being a 2p, AO times a 
coefficient. The values of the coefficients are given by the term in curly brackets. This 
term produces a discrete set of values, since n is a discrete set of integers, but we can 
sketch a continuous function [by replacing (n — 1)/6 with the continuous variable $] 
and then locate the places on this "coefficient wave" that correspond to the discrete 
points of interest. Sketches for this “coefficient wave" when j —3 and j =9 are shown 
in Fig. 15-6. The special points of interest, where actual coefficient values are given 


5n 


Figure 15-6 » Coefficient waves for j — 3 (dashed curve) and j =9 (dotted curve). Both curves 
intercept the same coefficient values (+1/+/6) at the positions related to carbon atoms 1—6, so the 
curves produce the same MO. 
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for benzene MOs, are also shown (at the positions labeled 1—6), and it is obvious that 
the same set of coefficients is produced by each function. Increasing j has caused the 
coefficient wave to oscillate with a shorter wavelength, but the extra oscillations do not 
register because they occur between the special points of interest where we are sampling 
the function (1.e., at the points where the carbon nuclei reside). In other words, the extra 
oscillations in our benzene coefficient functions are just “empty wiggling” and have 
no effect on the MO, so we cut off the 7 range where the meaningful wiggling starts 
to become empty wiggling. The reason that the particle in a ring does not show this 
behavior is that the exponential (or sine, cosine) functions in that system actually are 
the wavefunctions, while in benzene these functions are sampled only at discrete points 
where AOs are located, and the resulting coefficients are then used to produce MOs. 

Notice that the MOs for benzene as given by Eq. (15-4) are produced from an 
equation having the following form: There is an exponential term that, for each MO, 
supplies the coefficients for various carbons in the molecule, and there is a basis set of 
functions located on the various carbons. This basis set has the same “periodicity” as 
does the molecule. (In the case of benzene, we have so far taken this to be six identical 
2pz AOs.) Wavefunctions for all periodic systems have this same form—an exponential 
(or equivalent trigonometric) expression times a periodic basis. This is the content of 
Bloch's theorem, which we prove in the next section. 


EXAMPLE 15-1 What coefficients are generated for a benzene MO by a coefficient 


wave of the type shown in Fig. 15-6 with j = —3? 7 =0? 


SOLUTION » The arguments of the cosine function of Eq. (15-6) will be the negative of those 
of the j =3 case. Because the cosine is symmetric about an argument of zero, we obtain the same 
curve for j — —3 as for j — 4-3, hence the same MO coefficients. This demonstrates that the MO 
generated by j — 6 is the same as that generated by j. For j = 0, the cosine arguments all vanish, 
the cosines all equal +1, and the curve of Fig 15-6 becomes a straight line at a coefficient value of 
+1/ V6. This produces the lowest-energy MO of benzene. < 


E 15-5 General Form of One-Electron Orbitals in Periodic 


Potentials—Bloch's Theorem 


We will establish in this section the general mathematical nature of eigenfunctions for 
periodic hamiltonians. This is the content of Bloch's theorem. The actual proof will be 
carried through for cyclic “periodic” systems, and those results will then be extended to 
noncyclic periodic systems. The rationale for this is that a nearly infinite linear periodic 
structure can be treated as cyclic without introducing error. That is, a sufficiently long 
extended chain of atoms can be assumed to have the same wavefunctions and energies 
as the same chain joined end to end to form a cycle. For short chains and rings, this 
is not the case; the lowest-energy MO for hexatriene is not as low in energy as that 
for benzene, nor is it uniform over the whole chain (as it is in benzene). But for long 
enough chains, the difference becomes negligible. 

Bloch's theorem, as it applies to periodic cycles, states that eigenfunctions have the 
form 


V j(9) — exp j9)U ; (0) (15-7) 
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where i = 4/—1, j is an integer and U;(¢) has the periodicity of the cycle. If the 
cycle belongs to the C, point group, then U($) = U($ --2z/n). Functions of the 
form of Eq. (15-7) are called Bloch functions. It was stated in Section 13-8 that every 
wavefunction or orbital is a member of a basis for an irreducible representation for the 
point group of the system, so it follows that the wavefunctions we seek to characterize 
have to be bases for irreducible representations of the C, point group. These groups 
are discussed in Section 13-11, where it is shown that the functions e = exp(i@) and 
€* — exp(—i4) are bases for irreducible representations for these groups, which is to 
say that they are eigenfunctions for the rotation operator, no matter what the value of the 
integer n in C. We consider the more general functions exp(+i/@) with j an integer 
(so that the function obeys the cyclic continuity condition) and ask what happens when 
such functions are subjected to a rotation. Let the rotation operator be symbolized C7 
for a rotation of q times 27r/n. The n-fold cyclic system is invariant to this rotation if 
q is an integer. It is not difficult to show that (see Section 13-11) 


Ci f(b) = f(b —21q/n) (15-8) 


(For example, clockwise rotation by 60° brings to each point in the circle the value that 
used to be 60° in the counterclockwise direction.) Then 


Ci exp(ijb) = explij(@ — 2z4/n)] 
= exp(—271qj/n) exp(j9) (15-9) 


Equation (15-9) shows that exp(ij@) is an eigenfunction for the rotation operator, so 
exp(ij@) is a basis for an irreducible representation, with j any integer. We can build 
additonal flexibility into this eigenfunction if we write an exponential function in a 
more complicated way: exp[i(j +2N)@] with N also an integer. We know that this 
must still be a basis function because (j +N) must still be an integer. If we operate 
with CZ, we find that (Problem 15-6) 


Cj exp [i (j - n N)ó] 2 exp( —2zriq j/n) exp li Cj 4- nN)9] (15-10) 


This shows that our newest, most general, exponential has the same eigenvalue for 
C7 no matter what integer value we choose for N. [Observe that the eigenvalue in 
Eq. (15-10) depends on q, j, and n, but not N.] Therefore, since we have an unlimited 
number of choices for the integer N, we have an unlimited number of degenerate 
eigenfunctions for C7 for each choice of j. We can mix together such degenerate 
eigenfunctions in any way we please and still have an eigenfunction. Let us take the 
linear combination 


5 AwexpliGj +nN)¢] (15-11) 


N=—0o 


This can be factored into the form 


exp 9) » Ayn exp(inN@) (15-12) 


N--—oo 


The sum is a function of $ which we can call U(@). It has the same periodicity in $ 
as the cycle because n is equal to the number of identical cells in the cycle and N is 
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an integer. It is actually the general Fourier series expansion of a periodic function. 
For N =0, the contribution is constant (Ao) at all ø. For N = +1, the functions repeat 
n times around the cycle and clearly have the periodicity of the system. For N = +2 
the functions repeat 2n times, so they obviously still have the periodicity of the system. 
Since all the individual members of the sum have the periodicity of the system, so does 
the sum itself. The specific nature of the resultant function U ($) depends on the choice 
of coefficients A y. 
We have shown that 


expG j9)U ($) (15-13) 


with U($) n-fold periodic is a general mathematical form for bases for irreducible 
representations for the C, group. But wavefunctions, Y (¢), for a periodic potential 
must also be bases for such irreducible representations. Therefore, wavefunctions must 
be expressible in this form: 


V j($) — exp(ijo)U (9), jJ — 0 SETS eas (15-14) 


We are not claiming that all functions of the form Eq. (15-13) are wavefunctions for the 
n-fold periodic potential. Rather, we are claiming that all such wavefunctions belong 
to the class of functions represented by Eq. (15-13). Only certain choices of U(@) will 
serve to make Eq. (15-13) become a wavefunction for the system, and the choice of an 
appropriate function U (9) is not necessarily the same at different values of j. To make 
this dependence on j explicit, we include it as a subscript on U: 


Wil) — expGjo)U ; (9), J=0, +1, +2,... (15-15) 


with U;(@) n-fold periodic. This completes the proof of Bloch's theorem for periodic 
cycles. 

Before extending this result to noncyclic systems, a number of comments and clar- 
ifications should be made. First, the function U;(¢) may still seem to be something 
of a mystery. Think of it this way. For benzene 2 MOs we know we need to create 
a basis set having two properties: It should be like a 2p, AO near any carbon atom, 
and it should be the same at each carbon atom. Bloch's theorem does not comment on 
whether U should look like a 2p; AO at a carbon atom, but it does require that it be 
the same at each carbon atom, that it be the same at the midpoints between adjacent 
carbons, that it be the same at a point one bohr above each carbon atom, etc. In short, 
U must be symmetric for the six-fold rotation operation. It is up to us to figure out 
what that six-fold symmetric basis set should look like in detail, and to construct U 
appropriately. 

Second, U depends on j. What does this mean? Suppose we compare the lowest- 
and highest-energy 2 MOs for benzene. The lowest is totally bonding and the highest is 
totally antibonding. Ordinarily, we use the identical set of 2p; AOs as basis functions 
for both MOs, but this is not required. Perhaps the p; AOs for the bonding MO 
would be more appropriately chosen to be slightly larger, so they overlapped better, and 
maybe those for the antibonding MO should be slightly smaller, to reduce antibonding 
interactions. These objectives could be achieved, for example, by making U a mixture 
of 2p, and 3p, AOs on each center and letting the nature of the mixture depend on j. 
This makes for a more involved variational calculation, so it is normally not included in 
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cases like this one where such effects are expected to be small. (3p; character should 
not mix in to a significant extent because the 3p; AO is considerably higher in energy 
than the 2p;, AO.) However, there are many cases where a unit cell contains several 
AOs in the same symmetry class and of similar energy (e.g., 25, 2 ps), and it is then 
quite important to allow the mix of these to vary with j. A given band, for example, 
could start out being mainly 2s at low energies and end up being mainly 2 p, at the 
high-energy end. This means that the appropriate description of the function U must 
be redetermined for each choice of wavenumber j. 

Third, careful comparison of Eqs. (15-15) and (15-4) shows that they are not exactly 
the same. Equation (15-15) instructs us to find a periodic function U;($) and multiply it 
by exp(ij@) at every point in ġ. Think ofthe sine or cosine related to the exponential and 
imagine what this means as we multiply it times a 2p, on some carbon. Say the cosine 
is increasing in value as it sweeps clockwise past the carbon nucleus at 2:00 on a clock 
face. This produces a product of cosine and 2p, that is unbalanced—smaller toward 
1:00 than toward 3:00, because the cosine wave modulates U;($) everywhere. But 
Eq. (15-4) is different. It instructs us to take the value of the cosine at 2:00 and simply 
multiply the 2p,, AO on that atom by that number. The 2p; AO is not caused to become 
unbalanced. Only its size in the MO is determined by the cosine. Equation (15-4) is 
called a Bloch sum. Such sums are approximations to Bloch functions, but any errors 
inherent in this form are likely to be quite small if the basis functions and unit cell are 
sensibly chosen. (Using Bloch sums is similar in spirit to the familiar procedure of 
approximating a molecular wavefunction as a linear combination of basis functions.) 

Extending Bloch's theorem to linear periodic structures requires identifying an 
appropriate substitute for the coordinate $ and reconsidering the appropriate values for 
J (which we will call k in noncyclic systems). We require that these substitutions yield 
eigenfunctions and eigenvalues for the linear system that are the same as what results 
when it is treated as a cycle. Let us suppose that we have some very large number, n, 
of unit cells in the cycle, with a repeat distance of a. This means that the cycle has a 
C, axis and also a circumference of na, so the linear coordinate (call it x) ranges from 
0 to na as the angular coordinate $ ranges from 0 to 277. When we move s steps around 
the cyclic polymer, the exponential’s argument changes by a factor of 27r ijs/n, with j 
aninteger. The same number of steps should give us the same effect on the exponential 
for the linear polymer. If we choose our exponential's argument to be of the form ikx, 
then s steps creates the factor iksa, so we require that iksa = 27ztijs/n. This gives 
k—2n j/na, with j an integer. We see that, for extremely large values of n, k becomes 
an effectively continuous variable (points separated by 277 /na). Also, we see that k is 
uniformly distributed (j =0, +1, +2, etc.), which affects our DOS calculations, as was 
mentioned in Section 15-2. 

We can also ask about the range of k, which should correspond to the 0-27 range of 
p. We expect “empty wiggling” to occur when the sin or cos equivalent of exp(i kx) has 
a wavelength shorter than 2a. The largest nonredundant value of k should come when 
cos(kx) goes from cos(0) to cos(277) as x goes from 0 to 2a. Therefore kx =k(2a) =27, 
so k has a range of 27r /a. Since the convention is to center k about zero, k runs from 
—7 /a to zt /a. (Note that this range becomes infinite for the free particle in a constant 
potential, for which a is zero.) The range of k over which all unique wavefunctions for 
the periodic system are produced once and only once (—7/a < k < x /a) is called the 
first Brillouin zone (FBZ). 


Section 15-7 An Example: Polyacetylene with Uniform Bond Lengths 537 


For noncyclic, one-dimensional systems, then, eigenfunctions have the form 
We — exp kx) Ug (x), —m/a«kzm/a (15-16) 


Just as was true for cyclic cases, we can approximate this Bloch functional form 
with a Bloch sum: 


n—1 
Vy = Y S exp(iksa)U,(sa), —z/a «k xm/a (15-17) 
s=0 


In chemistry, it is Bloch sums that are normally used. 
Two- or three-dimensional periodic systems can be treated similarly, with separate 
k vectors for each of the independent translational directions. 


| J 15-6 A Retrospective Pause 


We have reviewed some familiar systems and presented Bloch functions and sums for 
one-dimensional cyclic and linear periodic systems. The practical lesson to this point is 
that one can generate useful one-electron wavefunctions for some such systems simply 
by choosing a basis set of AOs that is identical from one unit cell to the next and then 
modulating that set with exponential (or sine and cosine) functions. This is most easily 
done by using the exponential or trigonometric functions as “coefficient waves” from 
which coefficient values are plucked at points where a unit cell basis set is centered 
(i.e., by taking Bloch sums). In a case like the z electrons of benzene, each unit cell is 
a carbon atom and the basis set is a 2p; AO at each atom. The coefficients determined 
by exp(ij@) with j =0, +1, +2, 3 serve to define the wavefunctions completely. Once 
these are known, the energy values follow. In cases where several AOs are present in 
each unit cell the situation is complicated because the mix of these AOs may change 
as j (or k) changes. This added complexity is easily handled by computer programs 
working within the paradigm of the variational method. What is important for us to 
understand as chemists is what the computer is doing in such cases, why it is doing it, 
and what it means for the chemical and physical properties of the system. 


E 15-7 An Example: Polyacetylene with Uniform 
Bond Lengths 


Consider the molecule produced by addition polymerization of acetylene. The standard 
chemical representation for the all trans version of the molecule is shown in Fig. 15-7a. 
The resonance diagram makes it plausible that this polymer, like benzene, will have 
equal C-C bond lengths, intermediate in value between the lengths of single and double 
bonds. For the moment let us assume this to be true and examine the MOs for the 
7 orbitals of this system. To keep matters simple, we begin by treating the system at 
the simple Hückel level. This means that the hydrogen atoms can be ignored. Also, 
since only nearest-neighbor x interactions are accounted for, we can pretend that the 
carbon framework is linear (Fig. 15-7b). 
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Figure 15-7 » (a) All-trans polyacetylene in its two “resonance structures." (b) The linear model 
appropriate for a nearest-neighbor, z-only calculation. 


First we must choose our unit cell. The simplest choice is to make it a single carbon 
atom with the characteristic translational distance being a C-C bond length, shown as 
a in Fig. 15-7b. 

Next we must choose the periodic basis set U(x) for the z MOs. The normal 
choice for this (and also the choice dictated by the simple Hückel method) is a 2p; AO 
on each carbon. Furthermore, it is normal to assume that this basis does not change 
as we go from one MO to another, so we can dispense with the subscript k on U. 
(Hence, no variational calculation will be necessary in finding the wavefunctions for this 
system.) 

Equation (15-17) tells us how to produce wavefunctions. We simply take exp(i kx) 
times our basis set, with k taking on all values between —z/a and x /a, and pick off 
values at discrete points corresponding to carbon atom positions. Since it is more 
convenient to work with the real forms of solutions, we in effect choose a pair of k values 
(e.g., —7/4a and 7 //4a) so that we can generate a pair of trigonometric coefficient 
waves [cos(x x /4a) and sin(ztx/4a)]. The x MOs for regular polyacetylene produced 
from Bloch sums are shown in Fig. 15-8 for selected values of k. These can be used to 
illustrate some important points: 


1. The k =0 situation merely reproduces the basis set. 
2. As |K| increases, the coefficient wave goes to shorter wavelength and more nodes. 


3. Only one MO results from each of the +x /a extremes of the k range—these are 
nondegenerate solutions. 


4. The pair of MOs drawn for +x /4a are degenerate, as are those for +x /2a. There- 
fore, they can be mixed, either to generate complex functions or else simply to shift 
the phase. For example, the pair at k = +r /2a can be added or subtracted to produce 
the equally valid alternative pair shown in Fig. 15-9. In this way a phase shift of any 
degree could be created. There is no requirement that the MOs be "lined up" ina 
simple way with the atoms, as we have done in Figs. 15-8 and 15-9. (It is, however, 
usually more convenient.) 
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Figure 15-8 » Coefficient waves times a periodic basis set of one 2pz; AO on each carbon. (a) k — 0, 
so the coefficient wave is a constant and the periodic basis is unmodulated. (b)-(d) Plus and minus 
k-value exponentials are mixed to give trigonometric coefficient waves. In (d), one of these waves 
has nodes at every carbon so no Bloch sum function exists for this case. 


Once we have the MOs, also sometimes called "crystal orbitals" (COs) for infi- 
nite systems, we can consider their energies. In general, these would come from 
computations involving nuclear-electronic interactions, kinetic energies, etc. But the 
simple Hückel method takes the much easier approach of relating energy to bond order. 
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k= tmn/2a 


Figure 15-9 » An alternative to the Bloch sums shown in Fig. 15-8c. These have the same coeffi- 
cient wavelength but the wave is displaced by a/2. 


(See Section 8-9.) We can see at once that the MO at k =0 is totally bonding. By 
analogy with the totally bonding MO of benzene, its energy is a +26. The pair of 
MOs due to k = +r /2a are nonbonding. This is obvious from Fig. 15-8, but is also 
not hard to see in Fig. 15-9, which shows the MOs as having equal numbers of bond- 
ing and antibonding interactions. These MOs, then, have E =a. Finally, the totally 
antibonding MO at k = zr /a has an energy of a — 2p. (Note that the pairing theorem 
holds. Compare the coefficients of this MO with those at k — 0.) 


EXAMPLE 15-2 What is the energy of an MO in Fig. 15-8 having k = +x /4a? 


SOLUTION » The figure shows that these are triads of atoms having bonding interactions within 
themselves. Interactions between triads are nonbonding, since only nearest-neighbor interactions 
are taken into account in the HMO method. So E lies between o and o + 28. Within a triad, the 
coefficient ratios (if we choose the cosine set) go as cos(—77/4), cos(0), cos(x /4) or V2/2, 1, 2/2. 
Normalizing for a triad gives 1/2, 1/4/2, 1/2. Eq. (8-58) gives E =a + 28(1/24/2 + 1/24/2) = 
a+ 1.4148. This is the same as the result we obtained for the allyl radical in Section 8-6, which 
makes sense since, because they are nonbonding to their neighbors in this CO, they behave like 
isolated allyl systems. 4 


The Hückel z MO energies for regular polyacetylene can be plotted versus k. The 
result appears in Fig. 15-10 along with a plot ofthe density of states (DOS). Each carbon 
atom brings one z electron to the polymer, and these fill the lower-energy half of the 
MOs, so the highest occupied MO (HOMO) for the polymer is at E =a, |k| = 7/2a. 
Because the CO energies are identical for |k| and — |k|, we can observe all the unique 
energies by plotting over the range 0 < k < x /a. The k-range that produces all unique 
energies (rather than all the independent wavefunctions) is called the reduced first 
Brillouin zone (RFBZ). 

The term Fermi energy is often used in reference to electrons having the highest 
energy in the ground state of the system. Here, if we ignore the effects of thermal 
energy, the Fermi energy is the HOMO energy. When there is a gap between the HOMO 
and LUMO energies, there is some disagreement in use of this term. Physicists, for 
good theoretical reasons, place the Fermi level midway between HOMO and LUMO 
energies; chemists often continue to equate it to the HOMO energy. 
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Figure 15-10 » (a) Energy versus k for a chain of m AOs as calculated by the simple Hückel 
method with Bloch sums. (b) Sketch of the variation of density of states versus energy. (c) Sketch 
of the crystal orbital bond order (COBO) function. Solid line is bond orders between neighbors. 
Dashed line is bond orders between second-nearest neighbors (reduced by an arbitrary factor of five 
to reflect smaller extent of overlap between a AOs on second-nearest-neighbor atoms). (Adapted 
from Hoffmann et al. [1].) 


It is apparent that the energy span of the z band depends on the magnitude of 6 and 
that this in turn should depend on a, the distance between the carbon atom unit cells. 
The simple Hückel method in fact assumes p to be roughly proportional to AO over- 
lap. (The extended Hückel method makes the assumption explicit.) This means that 
the z band of polyacetylene should become “wider” as the distance between carbons 
decreases. Standard terminology is to refer to the span between lowest and highest 
energies in a band as the band width, even though normal graphical representations 
show this as a vertical distance.) 

There is yet another graphical representation of the relationships within these MOs 
that is useful to chemists. This is a plot of the amount of Mulliken overlap population 
between AOs as a function of E. This quantity, called crystal orbital overlap popula- 
tion (COOP), allows one to see at a glance how the bonding interactions change in a 
band. The simple Hückel analog of COOP is the crystal orbital bond order (COBO). 
Figure 15-10c shows the COBO for the linear polyacetylene 2 system as a function 
of k. The lower-energy COs have a high degree of net bond order, but this falls off to 
become zero at E =a and then negative at higher energies. This is just what we know 
must be happening. The Hückel MO energies are, after all, directly proportional to 
bond order, so low energy must go with large positive net bond order, etc. Also, the 
pairing theorem holds, so the nearest-neighbor COBO curve must behave antisymmet- 
rically through E = a. Although this particular COBO curve is not especially subtle, 
it is a good first example because we understand so well what it is telling us. 

A useful feature of COOP or COBO curves is that one has complete freedom as to 
which AOs (or groups of AOs) one can look at in this way. For example, the dashed 
line in Fig. 15-10c shows the overlap populations between x AOs on second-nearest 
neighbors in linear polyacetylene. (Even though all overlaps are formally assumed to 
be zero in the Hückel method, once we have the MOs we are at liberty to calculate 
the overlaps that actually exist between the AOs in these MOs.) We see that the 1,3 
interactions start out with positive overlap at low E, decrease to a negative value at 
E =a, and then rise to a positive value at high energy. This behavior is less obvious 
than the 1,2 COOP curve, though examination of the MOs at k = 0, x /2a, and zr /a 
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in Fig. 15-8 makes sense of it. (COOP curves are often shown without an explicit 
numerical overlap scale because they are usually used for qualitative comparisons.) 


EXAMPLE 15-3 Compare Fig. 15-8(c) to Fig. 15-9 with regard to second-nearest- 


neighbor COBO values. 


SOLUTION » The values should be equal since these are equivalent orbitals. It is easy to see that 
the value should be negative in Fig. 15-8(c) because half of the next-nearest-neighbor coefficient 
products are negative, and the other half are zero. In Fig. 15-9, they are all negative. Because the 
normalization constant in 15-8(c) is larger by a factor of 4/2, the sum is the same in each case. 
These are nonbonding orbitals, so their energies equal o. Fig. 15-8(c) shows that the corresponding 
second-nearest-neighbor COBO is indeed negative. 4 


The x system of polyacetylene is a convenient first example. It provides elementary 
examples of the unfamiliar (to chemists) band, DOS, and COOP plots. Also, the MOs 
are trivial to generate because they are identical to the Bloch sums of the basis functions. 
We have already seen similar behavior in the context of diatomic molecules, where the 
analog to a Bloch sum is a symmetry orbital (SO). (See Chapter 7.) In the case of H2, 
we saw that use of a minimal basis set ( 1s4 and 1sg) gives two symmetry orbitals 
(1s4 + lsg) and that these are identical to the MOs for H2. However, when we go to 
larger basis sets (1s, 2s, 2px, 2p,, 2pz on each center) we find that symmetry orbitals 
may no longer be the same as MOs—some MOs are mixtures of symmetry orbitals of 
like symmetry (e.g., the 25;5 SO mixes with the 2p; SO). We will now explore a case 
where the analogous mixing of Bloch sums occurs for a polymer. 

Consider what happens to our treatment of the linear chain of carbon atoms when 
we use an all-valence basis set (2s, 2p,, y, ) on each carbon. Since our chain lies along 
the x-coordinate, the symmetries of the 2s and 2p, AOs is c, while that for 2p, and 
2pz is x. We can use the extended Hückel (EH) method to deal with this basis. At very 
large internuclear distances, interactions between AOs on different atoms vanish, so our 
energy diagram becomes the simple two-level picture for isolated atoms (Fig. 15-112). 
As the atoms move closer together, bands of finite width begin to develop (Fig. 15-11b). 
At first, while the bands are still fairly narrow compared to the distance between E»; and 
Ep, we can describe them as almost pure s and pure p in character. That is, the MOs 
are essentially identical to Bloch sums of the basis set at this point. The 2s band rises 
in energy from a totally bonding set of 2s AOs to a totally antibonding set of 2s AOs. 
The 2p, and 2p; bands (degenerate) behave similarly. The 2p, band looks peculiar in 
that it drops in energy as k increases. Sketching out the Bloch sums for 2p, at k = 0 
and z/a (Fig. 15-12) shows why this happens. The 2p, AOs in their original basis 
set arrangement are antibonding. The k — 0 Bloch sum keeps them that way. At the 
other extreme, k = 7 /a and the Bloch sum reverses the sign of every second coefficient, 
making all of the interactions bonding. This is just the opposite of what happens for 
the other bands. We say that the 2s, 2px, and 2p, bands “run up" as |k| increases and 
that the 2p, band “runs down.” The simple rule is that bands run up when unit cell 
basis functions are symmetric across the unit cell (so they have phase agreement on 
the two ends) and run down for antisymmetric basis functions. Note that the 2p, band 
is wider than the 2p,, 2p; band because the 2p, orbitals overlap more strongly due to 
their orientation. Note also how the DOS plot (Fig. 15-11c) reflects the degeneracy of 
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Figure 15-11 » (a) Valence AO energies for an EHMO calculation of a linear chain of carbon 
atoms. (b) EHMO band diagram calculated at an interatomic distance of 220 pm. (c) DOS diagram 
and (d) COOP curve between adjacent carbons, for the same calculation. (Adapted from Hoffmann 
et al. [1].) 
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Figure 15-12 » 2px and 2py basis sets at k = 0, w/a. 


the 2p,, 2pz (or 2px) band as a large area. Because the data leading to Fig. 15-11 are 
produced by the extended Hückel (EH) method, we see the characteristic magnification 
of antibonding effects (Chapter 10), leading to asymmetries in DOS and COOP curves. 

The variational method underlying EH calculations on this system evaluates interac- 
tions between various Bloch sums at each k value. Because the Bloch sums (BSs) for 
2s and 2p, AOs have the same symmetry (o), a nonzero hamiltonian matrix element 
may exist between them, permitting mixing. However, the higher energy of the 2p, 
BS compared to the 2s BS discourages much mixing as long as the overlap between 
them is small. At large distances, the overlap is indeed small, so we see the relatively 
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“pure” bands of Fig. 15-11. Since the 2p; band disagrees in symmetry with the o BSs, 
it remains unmixed with them regardless of C-C distance. 


EXAMPLE 15-4 How should the areas compare in the DOS of 2s and 2p regions in 


Fig. 15-11? 


SOLUTION » To the extent that these regions remain “pure,” we should have three times more 
area in the p regions because, with three times as many basis functions, there are three times as 
many states. (The integral over DOS gives the number of states.) < 


Recalculating the band diagram for all-valence linear carbon at a smaller C-C dis- 
tance gives the band diagram in Fig. 15-13a. Several things have happened: The lowest 
band has dropped significantly in energy at k — O and risen slightly at k = 7 /a, so it is 
wider. The z band has gotten slightly wider too. The highest band has shot up to very 
high energies and has undergone a noticeable change of shape: It has a large hump at 
intermediate k values that did not show at large C-C separation. What is responsible 
for these changes? 

Even without BS mixing we should expect band widths to increase as a result of 
increased overlap between AOs on adjacent carbons. Hence, at first glance the lowest- 
energy band could be simply a more spread out 2s band. This cannot be right, though, 
because the EH method on a pure 2s band would give an antibonding-caused energy 
rise at k = x /a that is larger than the bonding-caused energy drop at k — 0. Instead, 
we are seeing a smaller rise. Something must be happening to either enhance the drop 
at k — 0 or counteract some of the rise at k = x /a. The DOS plots with 2s and 2p, 
portions shaded (Fig. 15-13b, c) indicate what is happening. We see that the lowest 
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Figure 15-13 » (a) EHMO band diagram for a linear chain of carbon atoms separated by 140 pm. 
(b) DOS plot, with 2s AO contributions shaded. (c) DOS plot with 2p, AO contributions shaded. 
(From Hoffmann et al. [1].) 
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Figure 15-14 » Band diagram for a linear chain of carbon atoms showing some COs and intended 
correlations. Positive and negative phases are indicated by shading or lack of shading. (From 
Hoffmann et al. [1].) 


band is still primarily 2s for small k, but becomes mainly 2p, at larger k. The highest 
band shows opposite behavior. It is mostly 2p, at smaller k and 2s at larger k. Mixing 
is occurring between Bloch sums. 

Another way to see these features is to sketch the MOs that the computer program 
reports for various k values. These are shown for k =0, x /a in Fig. 15-14. Itis obvious 
that the lowest band is all 2s at A — 0 and all 2p, at k = x /a, whereas the highest band 
is exactly the reverse. The bands have switched character. The pure 2s band we were 
discussing, that should have gone to very high energy, is represented by a dashed line 
in Fig. 15-14, and the pure 2p, band by another, and these dashed lines do just what 
we argued they should. 

The variational program is figuring out, at each value of k, how to get the minimum- 
energy c band, minimum-energy z band, etc. and is not seeking to maintain 2s or 
2po purity in the bands. Evidently, for the lowest band, it finds pure 2s to be lowest in 
energy at k=O and pure 2p, to be lowest at k = x /a. At intermediate k values it finds 
a mixture to be best. 

The dashed lines in Fig. 15-14 are the bands for Bloch sums of pure 2s and 2p, AOs. 
They differ from the bands for variational COs, where the Bloch sums are mixed. Even 
though they are not real bands, these dashed lines are nevertheless useful for guessing 
in advance of calculation what a band diagram will look like. The dashed lines are 
sometimes referred to as intended correlations because they show where the band that 
starts out as 2s, for instance, “intends” to be at k = 7 /a. These two dashed lines are 
forced to cross at some value of k, but we have seen (Section 7-6) that such crossings 
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are not allowed for wavefunctions of like symmetry (i.e., Bloch sums that have nonzero 
overlap). Because the two dashed lines belong to Bloch sums of the same symmetry, 
the noncrossing rule prevents their crossing, forcing the alternative correlation scheme 
shown by the solid lines. We will see later that an easy way to construct some qualitative 
band diagrams is to draw them first for simple Bloch sums and then to reroute some of 
the lines to remove forbidden crossings.! 


N 15-8 Electrical Conductivity 


The all-valence band diagram of Fig. 15-14 still leads to the prediction that the 7 band 
is half filled. Many texts make the point that metallic conductivity results when there 
are empty MOs immediately above the Fermi energy in a bulk solid, so we should 
consider whether or not polyacetylene is a metallic conductor. This leads us into the 
topic of electrical conductivity in one-dimensional periodic systems. 

The essence of electrical conductivity is electron flux, and this requires net elec- 
tron momentum. Consider the “band” for the one-dimensional free particle, shown in 
Fig. 15-1. If we had a band of this kind partially filled with electrons, the levels would 
be occupied from the lowest up to the Fermi level. For each occupied state exp(i kx) 
there would be an occupied mate exp(—i Kx). In short, each electron having momentum 
kh would be balanced by one having momentum —K/. Net momentum and electrical 
current would be zero. But if we apply a voltage, states exp(ikx) and exp(—ikx) lose 
their degeneracy. The state corresponding to motion in the direction of lower potential 
energy for the electron will now have lower energy than that for equal momentum in the 
opposite direction. As a result, some states that were originally above the Fermi level 
will move below it because they correspond to motion in the "right" direction, while 
others that were originally below the Fermi level will move above it. The electrons 
readjust their occupancies to fit the new scheme, and we now have more states occupied 
that correspond to momentum in the right direction and fewer corresponding to motion 
in the wrong direction. Electric current flows. Notice that this mechanism will work 
for very small applied voltages (metallic conductivity) only if there are empty MOs 
just above the Fermi energy. 

If the first empty levels are separated from the Fermi level by a modest energy gap 
(on the order of kg T, where kg is Boltzmann's constant, 1.38 x 10723 J K7!), then a 
small population of electrons will exist in the “empty” orbitals at thermal equilibrium. 
This allows electrical conductivity (though typically much lower than that normal for 
metals) which increases with temperature—the situation in intrinsic semiconductors. 
If the gap is large, no conductivity occurs except at extreme voltages and the material 
is an insulator. 

The above discussion is based on a free-particle wavefunction. In real materials 
the potential along a coordinate is not constant, so the band structure becomes more 
complicated than the parabola of Fig. 15-1. Also, charge-density adjustments occur 
that tend to screen out the applied field. However the basic requirement for metallic 
conductivity continues to be the absence of a gap between the highest filled and lowest 
empty MOs. Therefore, if the x band of polyacetylene really is partly filled, pure 


1 Apparent symmetry disagreement of bands for reflection through a plane perpendicular to the polymer axis 
and bisecting an atom or a bond does not lead to allowed crossing. This is explained in Section 15-13. 
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polyacetylene should be a good (metallic) conductor. However, we will show next that 
there are other factors operating that always open up a band gap just above the Fermi 
level in one-dimensional systems, thereby preventing metallic conductivity. 


[J 15-9 Polyacetylene with Alternating Bond 
Lengths—Peierls' Distortion 


We now reconsider our assumption of uniform C-C bond lengths. Since the molecules 
leading up to polyacetylene (butadiene, hexatriene, etc.) have alternating long and 
short C-C bonds, it is reasonable to ask whether this variation disappears completely 
in the limit of the infinite polymer. 

We return to the zr-only system at the simple Hückel level. This allows us to continue 
treating the polymer as though it were a straight chain of carbon atoms (Fig. 15-152). 
Since we are considering the possibility that the bond lengths alternate, we can no 
longer take a single carbon as our unit cell. (That would require two different translation 
distances.) Instead we take two atoms, say bonded through the shorter distance, and 
let the translation distance, a, equal the sum of the two bond lengths (Fig. 15-15b). 

The band diagram undergoes a marked change in appearance when we change from 
a one-atom to a two-atom unit cell, quite aside from the fact that we have two bond 
distances. To show this, we will first work out the two-atom unit cell band diagram for 
equal bond lengths. 

The most obvious change comes in the range of k. Recall that —/a < k < x /a. 
But our new choice of unit cell gives a new translation distance, a, equal to twice its 
previous value, so the numerical range of & is cut in half. 


(c) 


Figure 15-15 » (a) A section of a linear carbon polymer with alternating bond lengths. (b) Unit 
cell and translation distance. (c) Ungerade (u) and gerade (g) basis sets for a unit cell. 


548 Chapter 15 Molecular Orbital Theory of Periodic Systems 


The other big change is in the basis set U. Before it was one 2p; AO on each carbon. 
Now, however, we must have two AOs for each unit cell, and so we need to consider 
the most useful combinations to choose as basis. Generally we seek choices that will 
minimize subsequent mixing of Bloch sums, because evaluation of such mixing requires 
variation calculations and a computer. Bloch sums will not mix if they disagree in 
symmetry (for operations for which the polymer hamiltonian is invariant). Clearly, 
inversion through the center of a C-C unit cell is a symmetry operation for the whole 
polymer, so let us choose the g and u combinations of 2p, AOs shown in Fig. 15-15c. 
Since these basis functions cannot mix without producing a polymer MO of mixed 
symmetry (which is not allowed unless the MOs are degenerate), we can take the Bloch 
sums of these bases as identical to the nondegenerate COs at least. 

We can now construct the band diagram. At k = 0, our two basis functions give two 
Bloch sums, shown in Fig. 15-16a. The ungerade combination at k = 0 is obviously 
totally bonding over the whole molecule, has energy a + 26, and is identical to the 
MO we obtained at k = 0 when we chose one carbon atom per unit cell. (Compare 
with Fig. 15-8.) The gerade basis function at k = 0 is totally antibonding, has energy 
a — 26, and is the same as what we previously had at k = x /a (with a equal to one C-C 
bond length). These, then, are the COs we previously indicated to be nondegenerate. 
When we shift to a equal to two C-C bond lengths, we find at k = x /a that the g and 
u bases produce the polymer MOs shown in Fig. 15-16b. These are easily seen to be 
nonbonding MOs (E = a) exactly like those in Fig. 15-9. When we start the band 
diagram at k = 0, we find two energies (E =a 3-20). The u basis has the same phase 
on the ends of the unit cell, so it runs up (from o + 28) as k increases, ending up at 
E — o when k = 7x /a. The g basis has opposite phases on the ends of the unit cell, runs 
down from o — 2p as k increases, and finishes at E = o too, as shown in Fig. 15-17. 
The Fermi level is still at E = a, but this point now occurs on the right edge of the band 
diagram instead of at the midpoint. 

In going from a uniform chain of x AOs with one carbon per unit cell to an identical 
chain with two carbons per unit cell, we have not changed anything physically. We have 
merely changed our choice of representation. Therefore we must obtain the same MOs 
and energies either way, and it appears that we do, at least at the edges of the diagram. 
However, the appearance of the band diagram is affected strongly. The relation between 
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Figure 15-16 » Results of translating u and g basis functions through unit cell translation distances 
a, modulated by cos(kx) with k — 0 and x /a. 
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Figure 15-17 »  ;r-banddiagram from using two-atom unit cell of Fig. 15-15 with (a) s=/, (b) s Z I. 


the two band diagrams turns out to be very simple, though. Comparison of Figs. 15-17a 
and 15-10a shows that the band diagram for the two-atom unit cell is generated from 
that for the one-atom unit cell by simply folding the latter through a vertical line halfway 
across the diagram. There is always a smallest possible entity that can be chosen as 
the unit cell, and it is always possible to select any multiple of this for the unit cell and 
still generate the periodic structure. The band diagrams based on smaller unit cells can 
always be converted to those based on larger cells by the process of folding. 

Note that what was one x band in the earlier band diagram becomes two zt bands 
when we double the unit cell size. We shall refer to the lower of these two as the z band 
and the other as the z* band, the asterisk indicating net antibonding character. What was 
previously a degenerate level in the middle of the |k| range is now at the right edge of the 
diagram, with each member belonging to a different band (though still degenerate). 

There is a subtlety that needs to be addressed at this point. All that has been said up 
to now suggests that no energy changes occur as a result of our doubling the size of the 
unit cell. This is misleading. The Bloch sums resulting at intermediate k values are not 
identical for corresponding states in these two representations. This reflects the fact that, 
as k varies for a single-carbon unit cell, the coefficient of the pz AO can change from one 
carbon to the next. For the two-carbon unit cell, the AO coefficients are locked together 
in pairs. We cannot get a CO that looks like the one in Fig. 15-8b, for instance, using 
a Bloch sum based on the 7u basis set in a two-carbon unit cell. Doubling the unit cell 
forces the Bloch sums to be a more “coarse-grained” approximation. What saves the day 
is that the variational procedure mixes together the z and z* Bloch sums at intermediate 
k values in precisely the manner needed to undo the result of this limitation. The lesson 
we take from this is that, if we seek to estimate band wavefunctions and energies from 
Bloch sums without variational modification, we make the smallest error if we use the 
smallest possible unit cell, since this gives the finest-grained first approximation. 

Now that we understand the band diagram for a two-atom unit cell, we can consider 
the effects of bond-length alternation. This is most easily treated as a perturbation on the 
uniform bond-length system, with half of the bonds getting longer and the intervening 
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half getting shorter. It will suffice to consider the effects of this perturbation to first 
order and to consider only the four MOs shown in Fig. 15-16. Looking first at the k — 0 
MOs, we see that making half of the bonds longer will raise the energy of the bonding 
MO, but making the other half of the bonds shorter will lower the energy. Hence we 
expect little net energy change. The totally antibonding MO will likewise respond 
oppositely to each type of change, so the effects will again cancel. To first order, then, 
we expect the perturbation to have little effect on the left side of the x band diagram. 

On the right side, the situation is much more interesting. The MOs here (Fig. 15-16b) 
are degenerate (both nonbonding, E = o), so we must be wary. There is only one 
pair (among the infinite number of possible mixtures) that is "proper" for evaluating 
this particular perturbation. From Chapter 12 we know that the proper pair is that 
which responds most differently to the perturbation. Consider first the MOs shown in 
Fig. 15-8c. It is easy to see that neither of these would undergo any energy change as 
bond lengths change because there is no x AO overlap at all between nearest neighbors. 
On the other hand, the versions of these MOs shown in Fig. 15-9 or 15-16b respond 
strongly to the perturbation. One of them is bonding across the bonds being shortened 
and antibonding across the bonds being lengthened, and its energy will be lowered 
by both factors. The other is antibonding across the shortening bond, bonding across 
the lengthening bond, so its energy will rise. Clearly, this choice of MOs leads to the 
most different responses to this perturbation, so this is the prediction we use—that the 
endpoints of the curves at the right side of the band diagram will split apart, one going 
up and one going down (Fig. 15-17b). States at slightly smaller |k| are slightly less 
affected, etc., so the curves split apart over a range, not just at their termini. 

Will the polymerelectto make this bond-length distortion? Yes, because the occupied 
MOs are affected only by energy lowering. All of the MOs of increasing energy are 
empty. This situation is analogous to what we saw in Chapter 8 for molecules having 
partly occupied degenerate MOs. In that context, the molecule undergoes what is 
called Jahn- Teller distortion. For infinite periodic materials, the phenomenon is called 
a Peierls distortion. 

We should consider whether the perturbation affects the other bands (c bands) in a 
manner to counteract the effect on the m bands. Although some splitting does occur 
for those bands too, the effect on overall energy is much smaller because these bands 
are either completely occupied, so that the effects of energy rise and lowering are both 
registered, or else are empty, so that neither effect registers. Thus, it is the partly filled 
band that determines the overall energy change due to the perturbation. 

We have been dealing with a half-filled band, but it is not difficult to show that a 
suitable distortion can be found to produce a gap at the Fermi energy for any fractional 
degree of filling. 

We arrive, then, at the conclusion that a one-dimensional periodic structure having 
a partly filled band is unstable with respect to a distortion in bond lengths that will 
produce a gap at the Fermi level. This means that polyacetylene cannot be a metallic 
conductor, though it might be a semiconductor if the gap is not too large. 

Finally, we must address the question of how any metal can be a conductor. How can 
sodium, say, have a partly filled band without a band gap? Why doesn't the metal relax 
into a lower-energy structure having a gap at the Fermi energy? The answer is that, 
while one-dimensional systems can always find a distortion that has opposite effects on 
the energies of the MOs above and below the Fermi level, multidimensional systems 
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cannot. A distortion that would work to split the MOs around the Fermi level according 
to their behavior in one coordinate does not necessarily work when we examine what is 
happening in the other coordinates. A kind of coincidence is needed that is not usually 
found except in crystals called quasi-one-dimensional. As a result, two- and three- 
dimensional crystals with partly filled bands can exist without undergoing spontaneous 
structural reorganization and losing metallic conductivity. 


E 15-10 Electronic Structure of All-Trans Polyacetylene 


We finish our discussion of this polymer by examining the band structure for the 
nonlinear, all trans structure with alternating bond lengths (Fig. 15-18a) as calculated 
by the extended Hückel method. 

The EHMO band structure appears in Fig. 15-18b. There are 10 valence AO basis 
functions per unit cell (two carbons and two hydrogens) so we get 10 lines in the band 
diagram. Note that, since the chain is no longer linear, the 2p, and 2p; bands are no 
longer degenerate. One of these is still perpendicular to the molecular plane, hence is 
still of z symmetry, but the other now lies in the plane and has o symmetry. 


DOS 


(d) 


Figure 15-18 » (a) Unitcell for all-trans polyacetylene with alternating bond lengths of 136 pm and 
144 pm. (b) Valence-band diagram calculated by the EHMO method. All bands have gaps at k = x /a. 
(c) Magnified band diagram showing region near the Fermi level. (d) Density of states in region of 
Fermi level. Dashed line shows z-band portion. (Parts a-c adapted from Hoffmann et al. [1].) 
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A gap resulting from bond-length alternation occurs at the right edge for each of 
the bands, though it is quite small in bands associated with MOs that have small C-C 
overlap. A density-of-states plot for the occupied valence ø bands and the x bands 
shows the gap to be just above the highest-energy z-band level. 

With two exceptions, the EHMO band structure shown in Fig. 15-18a, b for alter- 
nating bond lengths is almost indistinguishable from that for uniform bond lengths 
(not shown). The only significant differences are that (1) the avoided crossing between 
o levels shown in 15-18b becomes a true crossing in the more symmetric, uniform bond- 
length case, and (2) all the gaps at k = x /a disappear for uniform C-C bond lengths. 


[J 15-11 Comparison of EHMO and SCF Results 
on Polyacetylene 


So far our entire discussion of band theory has been in the context of Hückel-type mod- 
els. It is possible to do band calculations using a self-consistent-field (SCF) approach 
wherein the contributions to one-electron energies resulting from Kinetic energy, 
nuclear-electron attraction energy, and electron-electron repulsion and exchange 
energies are each explicitly calculated as described in Chapter 11. As pointed out 
there, the total energy is not the same as the sum of one-electron energies in such 
calculations, since this would double-count the interelectronic interactions. Also, 
the energies for virtual orbitals are not given the same physical interpretation as for 
occupied orbitals. For a calculation on a neutral n-electron system, the occupied MO 
energies are appropriate for an electron in the neutral system interacting with the n — 1 
other electrons. The virtual MO energies are appropriate for an additional electron 
(electron n + 1) interacting with the n other electrons (with the proviso that the original 
n electrons have not reacted to the presence of electron n + 1). Thus, the virtual MOs 
refer to a different system (the unrelaxed negative ion) than the occupied MOs. We 
will see that this affects the nature of band diagrams. 

In contrast to SCF calculations, Hückel methods evaluate orbital energies entirely 
in terms of nodal behavior. Simple Hückel theory calculates bond orders and converts 
these to energies. As more and more nodes go into the MO, the bond orders drop 
from net positive through zero (nonbonding) to net negative, and the MO energies rise 
accordingly. Since overlap is neglected, the MO coefficients remain similar in absolute 
value over this range, so the high-energy antibonding MOs are elevated above zero 
energy as much as the bonding MOs are depressed below it, as the pairing theorem 
requires. The extended Hückel method is similar in spirit but evaluates the overlaps 
between AOs explicitly in order to arrive at interaction elements that are tailored to 
explicit AO types, the particular kinds of atoms involved, and the explicit distance 
between them. Importantly, this method does include AO overlap in MO normalization, 
which leads to very much larger coefficients in the highly noded MOs and a resulting 
"inflation" on the high-energy end of the energy spectrum. However, because these 
high-energy MOs are usually not occupied with electrons, they do not affect predictions 
and can generally be ignored. In contrast to the SCF situation, there is no qualitative 
difference between the interpretation appropriate for an empty and a filled simple or 
extended Hückel MO. The lowest empty MO is simply an MO that is a little more 
antibonding than the highest occupied MO. If two Hückel MOs are equally antibonding, 
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Figure 15-19 » SCF band diagrams computed for all-trans-polyacetylene. (a) All C-C bonds 
1.39 A. (b) Alternating bond lengths of 1.3636 A and 1.4292 A. (From André and Leroy [2].) 


they have equal energy, even if one is occupied and the other is not, whereas this is not 
the case for two ab initio SCF MOs. 

We can see the results of these factors when we compare the band diagram for 
polyacetylene calculated by an SCF procedure with that from an EHMO program. 
Figure 15-19 shows the SCF bands resulting from all-trans polyacetylene with uniform 
and with alternating C-C bond lengths. Comparing these with the EHMO results 
(Fig. 15-18a and discussion in previous section), we see that the occupied valence 
bands agree quite well in the two approaches (the five lowest curves in Fig. 15-18a 
and the third through seventh lowest curves in Fig. 15- 19b). This is important if these 
methods are to agree in predictions of relative energies of various polymer structures. 
The highest-energy band in the EHMO diagram is much higher than in the SCF diagram 
as a result of the inflationary factor described above. This is not a problem as long as we 
do not try to use this band for predictions. The most important difference is that the SCF 
band diagram for uniform C-C distances shows a sizable zt—r* band gap, while the 
EHMO method would show no gap there. The SCF gap gets larger when bond-length 
alternation is introduced, while the EHMO gap becomes finite, so the methods agree 
that bond-length alternation increases gap size. 

The apparent disagreement about the existence of a gap in the uniform polymer is 
really not a disagreement in light of what we have seen about the different ways these 
methods define orbital energies. The EHMO method indicates that the highest 2 and 
lowest zt * COs have the same amount of overlap-induced bonding energy (the MOs are 
the same, merely differing in phase). The SCF method indicates that the /east stable 
electron in the neutral polymer (at the top of the x band) is at lower energy than an 
electron can achieve in the best CO it can find in the (unrelaxed) negative ion (at the 
bottom of the z* band). The EHMO method is suggesting that the uniform polymer 
should be a conductor (if we could make it stay uniform). The SCF method is telling 
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us (via Koopmans' theorem) that the energy needed to remove an electron from neutral 
polyacetylene is larger than the energy released when neutral polyacetylene gains an 
electron (without relaxing). It would be incorrect to use the EHMO band diagram for 
a Koopmans' theorem analysis and say that the ionization energy and electron affinity 
of polyacetylene have the same absolute value. It would also be improper to argue 
that the large gap seen in this SCF calculation on uniform polyacetylene means such 
a system would not be a conductor. Both types of calculation have advantages and 
disadvantages, and these include their appropriateness for revealing various properties. 
We will continue to use Hückel methods in this chapter. 


[| 15-12 Effects of Chemical Substitution on the x Bands? 


We have seen that Peierls’ distortion produces a band gap at the Fermi level in poly- 
acetylene. Another type of change we can consider is replacement of one kind of atom 
in the unit cell with another. While this is not a modification that the polymer can make 
spontaneously, it is a way for us to predict what sort of changes in band structure would 
result if substitutional relatives of polyacetylene were synthesized. 

Consider what would happen if one C-H group in the two-carbon unit cell were 
replaced by N. In making this substitution, we lose a hydrogen atom and its 1s AO, 
so the all-valence band diagram would have 9 lines rather than 10. Also, we replace 
one carbon with a more electronegative atom having greater attraction for its electrons. 
At the simple Hückel level, this would lead to use of a coulomb integral for nitrogen 
that is lower in energy than that for carbon: aw: = oc + AN: 8. According to Table 8-3, 
hw: is 0.5. Atthe extended Hückel level, valence state ionization energies for nitrogen 
would replace those for carbon. Ab initio methods would account for the change by 
increasing the nuclear charge from 6 a.u. to 7. Because the results of all these methods 
agree qualitatively, we continue the discussion in terms of the simple Hückel method 
since all relevant calculations can be done mentally. 

We again start with uniform bond lengths and consider only the z electrons. It is not 
difficult to predict the changes, to first order, that will occur at k = 0 when N replaces 
C-H. Referring once more to Fig. 15-16, we note that both the z and z'* bands place 
equal amounts of z-electron density on every atom. Therefore, each energy should 
drop at k — 0 by an amount reflecting the fact that each electron now spends half of its 
time in a nitrogen 2p AO. At the simple Hückel level this is easily seen to be an energy 
drop of 0.258. At k = x /a, we once again confront the question as to which of the 
infinite number of linear combinations of degenerate COs is the correct set to use in 
evaluating the substitutional perturbation. Both members of the pair we used to evaluate 
bond-length alternation (shown in Fig. 15-16b) place equal density on every carbon, so 
both COs would drop from o to œ + ^p /2 and remain degenerate. Maximum difference 
in response comes from using the pair shown in Fig. 15-8c because one of these COs 
places all of its density on the set of atoms that become nitrogen while the other places 
all of its density on the atoms that remain carbon. The first-order change predicted by 
the Hückel method, based on the principle of “maximum difference in response" from 
degenerate-level perturbation theory, is that one level drops to a + ^f while the other 
remains at œ. The results of these considerations are sketched in Fig. 15-20a. 


2This section follows closely the discussion of Lowe and Kafafi [3]. 
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(b) 


Figure 15-20 » (a) z band of polyacetylene (simple Hückel method) with alternating C-H groups 
replaced by nitrogen. (b) Same as (a) except that the remaining C-H groups have been replaced by 
an atom for which the correction factor h is half as large as that for nitrogen. 


We can go further and consider the effects of replacing the other C-H groups with 
something else. If we continue the perturbational treatment using the original poly- 
acetylene functions, it is easy to see that now it will be the other band that will be 
affected at k = x /a (Fig. 15-20b). 

Since substitution removes the degeneracy at k = z/a, we now find the situation 
changed if we go on to consider alternating bond lengths. Whereas before we found 
thatthe uniform structure would distort to open a gap, we now find a gap already present 
due to substitution. Furthermore, the COs now describing the nondegenerate states at 
the upper and lower edges of the gap are the ones that we saw earlier are not affected by 
bond-length changes. This means that the substitutional change strongly suppresses the 
tendency of the polymer to have alternating bond lengths. (To first order for degenerate 
states, perturbations will counteract each other unless they share the same set of proper 
zeroth-order wavefunctions. See Section 12-8.) 

There is a lot going on that we are ignoring. The C-N length is different from the 
C-C length, the COs change when we go to higher levels of perturbation theory, and 
Hückel methods are very approximate and become less reliable in systems with polar 
bonds. It should nevertheless be clear that, if one were looking for ways to predict the 
effects of chemical substitution on band gap, this sort of approach would provide the 
conceptual framework. 


E 15-13 Poly-Paraphenylene—A Ring Polymer 


We turn now to the x COs of poly-paraphenylene (PPP). This system serves as an 
excellent illustrator of principles introduced in earlier sections.? 


3This section follows closely the discussion of Lowe et al. [4]. 
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Figure 15-21 » (a) Two of the choices for the unit cell of PPP. (b) The 2 MOs of benzene. Shading 
indicates negative phase. (From Lowe et al. [4].) 


As its name indicates, PPP results when benzene molecules are attached in a planar 
chain, with each benzene being linked to two neighbors at para (opposite) positions 
(Fig. 15-21a). There is an infinity of ways we could select the repeating segment, or 
unit cell, for this polymer. Two of the most obvious are indicated in Fig. 15-21a, but 
any portion of length a (or na) would serve. Since selection of the unit cell determines 
the basis set of MOs we will use to construct the COs, we choose the unit cell on the 
left and start with the familiar zz MOs of benzene, sketched in Fig. 15-21b. 

Two of the benzene MOs, $1 and $e, are nondegenerate. There are two degenerate 
pairs: $2, $3 and $4, $5. Our first task is to decide how these MOs will respond to the 
perturbation resulting from linking them to a pair of neighbors in an infinite chain. We 
will assess the interaction energies at the simple Hückel level, since the computations 
are so trivial that the reasoning is less obscured. Before the perturbation, then, these 
MOs are at energies Ey, =a -- 20; Eo, 4, — a + B; Ega, Eg; =a — B; Eg, =a — 2p, 
where f is a negative quantity. 
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Figure 15-22 » Crystal orbitals for PPP constructed as Bloch sums from ¢ at |k| =0, x /2a, T/a. 
(From Lowe et al. [4].) 


We first consider the lowest-energy MO, $;. We want to know the energy that 
results when $, is a component of COs in which |k| ranges from O to 2/a. We can 
estimate this to first order by imagining that the MO does not undergo any change 
when it is incorporated into the CO. That is, it remains six 2pz AOs, all with the same 
coefficient (1/4/6), just as in the molecule. When k — 0, the MO repeats itself [times 
exp(ika) = exp(0) = 1] for each translation through the distance a, giving the pattern 
shown at left in Fig. 15-22. An energy of a + 2f results from the unchanged bond order 
within each benzene, but there is now some additional bond order between molecules. 
Recalling that the bond order between AOs j and k in MO i equals cj cy, this is 
(1/4/6)(1/4/6) 21/6. Since MO energy involves 2f times bond order, such new bonds 
are worth |6|/3. Since there is one such new bonding interaction per ring, the energy 
of the CO at k =0 is lowered by |8|/3 to o -- 2.33p. Qualitatively, the MOs have been 
linked in a bonding manner so the first-order energy is lower than that for an isolated MO. 
At k= x /a, the MO ¢ is multiplied by exp(ika) = exp(iz) — cos(z) +i sin(z) — —1, 
for each translation through the distance a, resulting in the situation sketched at the 
right of Fig. 15-22. When we analyze this to first order, all is as before except that the 
additional interaction is antibonding, so the energy rises by |B|/3 tow + 1.678. At the 
halfway point, when k = x /2a, $, oscillates half as fast, going through zero on every 
second unit cell, giving the middle sketch of Fig. 15-22. Obviously, $, is not interacting 
at all with a companion on either side, so its energy remains unperturbed at a + 2p. 
The resulting CO energy curve, labeled 7r, runs up (by 26/3 to first order) because the 
unit cell MO has phase agreement at the two points of attachment to the polymer chain. 

The analysis for $e is identical except that there is phase disagreement at the points 
of attachment, no matter which opposite set of carbons we choose. This means that the 
band ;rg runs down. Since the coefficients all have the same absolute value as in $1, 
the first-order interaction magnitudes are the same, so zrg runs from o — 2.336 at k — 0 
through a — 2p at k = x /2a to a — 1.678 at k= m/a. 

Analysis of the degenerate MOs requires that we work with the proper zeroth-order 
versions. We need to ask, for each level, which orthogonal pair responds most differ- 
ently to the perturbation resulting from linking onto para carbons. Alternatively, we 
can select a reflection plane that does not move the linkage sites and then seek the 
orthogonal pair having opposite symmetries for reflection through that plane. Such a 
plane is the one perpendicular to the benzene plane and intersecting both of the linking 
sites. Either of these approaches leads to the conclusion that the proper zeroth-order 
MOs are pairs such that one MO places a node at the linking carbons and the other 
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places an antinode there. Looking at Fig. 15-21b, we see that we can use these MOs 
as they are sketched provided we assume that the linking occurs at the carbons located 
at the 3 o'clock and 9 o'clock positions (1.e., atoms 1 and 4 in Fig. 15-21a). Once we 
ascertain this starting point for analysis, the rest is easy. COs 7r? and 75, resulting from 
g2 and $s, are unaffected by the polymerization perturbation because they are zero at 
the linking sites. 773 runs down because the AO phases disagree at the linking sites, but 
74 runs up by an equal amount because phases agree. 73 and 7x4 are wider bands than 
xı and xe because the coefficients at the linking sites are larger. In fact, they are a 
times larger (see Appendix 6 for coefficients), so the bands are twice as wide. All of 
these considerations are brought together in Fig. 15-23. 

The first-order simple HMO band diagram of Fig. 15-23 must be considered tentative 
because it shows two curve crossings and we have not yet considered whether these are 
allowed by symmetry. The noncrossing rule tells us that such crossings are forbidden 
if the COs agree in symmetry for all operations, so it is necessary to compare the 
symmetries of 7r? and 73 and also of 7x4 and 75. Examination of the COs sketched in 
Fig. 15-23 makes it clear that, in each pair, there is symmetry disagreement for reflection 
through a plane perpendicular to the polymer plane and containing the polymer axis. 
(Call this o1.) Therefore, the crossings are allowed. 


: 
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Figure 15-23 »  First-order simple Hückel band diagram for PPP showing fragments of COs at 
|k| =0,2/a. The curves are sketched between points computed only at |k| = 0, 7/2a, and z/a. 
(From Lowe et al. [4].) 


Section 15-13 Poly-Paraphenylene—A Ring Polymer 559 


One might select a different plane, say the one that is perpendicular to the polymer 
axis and bisects a benzene ring. (Call it 02.) This also appears to be a plane of symmetry 
disagreement between 74 and zs, but, surprisingly, this disagreement alone would not 
allow crossing. This subtlety arises from the existence of two degenerate COs for 
each value of |k| (except at 0 and w/a). For example, 74 at |k| = x /2a is multiplied 
by either +7 or —i for each translation by a. For convenience in picturing COs, we 
take appropriate linear combinations to achieve the real forms that correspond to the 

+, 0, —, 0, +, 0, — pattern of a cosine wave and the corresponding 0, +, 0, —, 0, +, 0 
sine wave pattern, just as we did for Figs. 15-8b and c. These real pairs for 74 and 75 are 
shown in Fig. 15-24. Evidently, if one member of a pair is symmetric for the reflection 
05, the other is antisymmetric. This is simply a manifestation of a general property 
of sine and cosine functions and obviously holds for all the bands at intermediate |K| 
values. Now we can see that the apparent symmetry disagreement we found between 
714 and 75 for o» reflection is only part of the story and that, if one of the real 14 COs 
disagrees with a real 25 CO in symmetry, the other real 74 must agree with it. This in 
turn means that, when we convert back to exponential versions, which are sine-cosine 
mixtures, there will always be some symmetry agreement between 74 and 75, so the 
bands still cannot cross (unless there is some other symmetry disagreement). Therefore, 
symmetry with respect to a reflection perpendicular to the k axis is not useful in band- 
crossing analyses. (A more general approach to such analyses is to assign symmetry 
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Figure 15-24 » Sine and cosine versions of two crystal orbitals of PPP. (a) z4 at |k| = x/2a. 
(b) z5 at |k| 2 z/2a. o and o» indicate where reflection planes intersect the molecular plane. (From 
Lowe et al. [4].) 
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Figure 15-25 »  EHMO ;r-energy bands in eV for (a) PPP, (b) PPP-N2. (c) PPP-N2N6. All C-C 
and C-N bond lengths are set at 1.40 A, all angles at 120°. (From Lowe et al. [4].) 


labels to the COs, which effectively reveals symmetry agreement or disagreement for 
all symmetry operations of the polymer.) 

A variational EHMO band calculation results in the diagram of Fig. 15-25a. There 
is much similarity with the simple Hiickel first-order band diagram of Fig. 15-23, but 
there are also some differences. The m2 and zs lines are not exactly horizontal, so 
these bands now have finite width. This results from the ability of AOs separated by 
two or more bonds to interact weakly in EHMO calculations. The 74 and z bands are 
much wider than zr, and 73, a result of the inflation at high energies characteristic of 
EHMO calculations. The CO coefficients indicate that some mixing of Bloch sums has 
occurred. For instance, in x; at k — 0 not all carbon coefficients are equal: The ones 
on the linking para carbons are larger. Variational mixing has managed this because it 
produces more overlap population and therefore a lower minimum energy root. Finally, 
the degeneracies at k = x /a of zt, and m3 and also of 24 with 7e are lost. The gap 
between 74 and z results from an avoided crossing. We can tell that this is the case 
because the EHMO-computed wavefunction for m6 at k = 2/a, when sketched (not 
shown), looks like the second-highest CO sketched at the right of Fig. 15-23. That is, 
it is made up of $4-type monomer MOs, so it is the “intended correlation" point for 
74. But 4 and 7x6 are both symmetric for o1, hence cannot cross. On the other hand, 
examination of the CO coefficients at k = x /a for zr, and 73, shows that this gap does 
not result from an avoided crossing. These bands merely fail to meet as they run up or 
down to their intended correlation points. 

The effects of chemical substitution on the band structure of this system can be ana- 
lyzed using perturbation arguments, just as was done earlier. Consider the effect, to first 
order, of replacing the C-H group at position 2 (see Fig. 15-21a) with a nitrogen atom 
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Figure 15-26 » zy, and 73 at |k| = x /a, shown at left as Bloch sums from the benzene basis, are 


degenerate and mix under the influence of a perturbation at position 2 to produce proper zeroth-order 


functions 2 and "b (From Lowe et al. [4].) 


to form what we will code as PPP-N2. Because of nitrogen's greater electronegativ- 
ity (more negative valence-state ionization energy) we expect any band's energy to be 
lowered at each k value by an amount proportional to the amount of electronic charge 
the band places at position 2. All of the COs place some charge at position 2 when 
k — 0 (see Fig. 15-23), so all of these band energies should drop on the left side of the 
band diagram. It appears that all the COs place charge at position 2 for k = x /a also, 
but now we have degeneracies (or near-degeneracies) to consider. We need to discover 
the linear combinations of degenerate COs that lead to zeroth-order COs having the 
greatest difference in their responses to the perturbation. Clearly it should be possible 
to mix 73 and 74 to achieve cancellation at position 2. The orthogonal mate for this 
new CO is correspondingly larger at position 2 because the charge placed at any site 
by this pair of COs must not vary with CO mixing. Figure 15-26 shows the results of 
mixing zr, and 73 to produce n and mi Note that nA is Zero not only at position 2 
but also at position 6, due to the symmetries of x, and 773 with respect to a c; reflection. 
We expect, then, that n will be unaffected by substitution at position 2, whereas 
m will be strongly lowered in energy. A similar analysis results for COs m4 and 7t. 
Looking at the results of an EHMO variational calculation on PPP-N2 (Fig. 15-25b), 
we find that these expectations are borne out: All band energies at k — 0 are lowered, as 
are all but two at k = x /a. These two exceptions (7r? and 76) are essentially unaffected 
by nitrogen substitution at position 2. 

Notice that the 22-73 and 714—775 crossings in PPP become avoided crossings in 
PPP-N2. This results from the loss of o, reflection as a symmetry operation for the 
system when nitrogen is substituted at position 2. Whereas these COs can have different 
symmetries for c, reflection in PPP, this is not possible for PPP-N2. (There is only 
one point-symmetry operation for PPP-N2, namely a reflection through the molecular 
plane. All m COs are antisymmetric for this. Hence no crossings are possible among 
7 bands.) 

When we replace yet another C-H (at position 6) with nitrogen, giving PPP-N2N6, 
we find (Fig. 15-25c) that the band energies all move down some more in energy except 
for the two that did not move before, because, as we saw, the corresponding COs 
for these two are also zero at position 6. This substitution restores o, reflection as a 
symmetry operation, so the 74—75 crossing recurs. Accidental degeneracy now occurs 
for m2 and z3 at k — zt /a. 
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[| 15-14 Energy Calculations 


Suppose we wish to compare the calculated energy of regular polyacetylene to that of a 
bond-alternating version, or that of PPP with PPP-N2. How do we get energies from the 
band calculations for these two cases? In a Hückel-type calculation of a molecule, we 
simply add up the one-electron energies. In a band calculation we are faced with a very 
large set of one-electron energies—one for each k value. How do we deal with this? 

Consider again the simple Hückel band diagram for regular polyacetylene 
(Fig. 15-172). This band diagram indicates that electrons in the x CO at k = 0 have an 
energy of a + 2f, those at k = x /a have E =a, and those at k = x /2a have an energy 
somewhat below a 4- B. It is fairly obvious that the CO average energy is somewhat 
below o + B. Each CO has two electrons delocalized over the polymer, but the net 
number of electrons per unit cell is two. Therefore, with an average 2 CO energy of 
below a + B and two z electrons per unit cell, we have a x energy per unit cell below 
2a +26. To be more precise would require a more precise average energy for the CO. 
[Even without such a calculation, however, we can see that bond alternation causes the 
average energy of the occupied z band to drop (Fig. 15-17b), so the zr-electron energy 
per unit cell is lower for the alternating structure. ] 

The problem, then, is to calculate an accurate average energy for each of the filled 
bands, the average being over the first Brillouin zone, and then multiply each such 
average energy by the number of electrons in that band, per unit cell. The sum of these 
is the Hückel total energy per unit cell. (This procedure assumes that the bands are not 
partially filled.) 

The practical difficulty in doing this is that each variational calculation is carried 
out at a single point in k space. Thus, for the variational bands sketched in Fig. 15-23, 
an EHMO calculation was made at k — 0, another was made at k = x /2a, and a third 
was made at k =x /a. Then lines were drawn to connect the energy points found at 
these & values, with consideration being given to the noncrossing rule. Obviously, this 
produces a rather qualitative diagram. To refine it would require making additional 
variational calculations at intermediate points in k space. Thus, we must be concerned 
with the trade-off between accuracy of final average energy and effort needed to achieve 
well-characterized band energies from which to calculate that average. 

Achieving accurate average CO energies from values at a few k points is possible if 
those points are sensibly chosen and if appropriate weighting factors are employed. The 
problem of choosing the correct few points and their weight factors has been worked out 
by Chadi and Cohen [5] for multidimensional systems of various symmetries. Although 
this is a matter of real practical interest, we will not explore it further here. 


[| 15-15 Two-Dimensional Periodicity and Vectors 
in Reciprocal Space 


In one-dimensional problems there is a single translation direction, a single step size 
a, and a single accompanying variable k. It is a simple matter in such problems to treat 
a and k as scalars and to plot band energies versus k in the range of the first Brillouin 
zone, —r/a < k < 1 /a. Step size a has dimensions of length and k of reciprocal length 
to give an argument for exp(ikna) that is dimensionless. 
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Figure 15-27 » (a) A two-dimensional rectangular lattice (solid points) showing translation vectors 
a, and a» as well as reciprocal vectors b, and b». (b) The reciprocal lattice (hollow points) with 
the first Brillouin zone outlined by dashed lines. (c) The first Brillouin zone with the reduced first 
Brillouin zone (RFBZ) shaded. 


In multidimensional systems we must keep track of k values for waves oriented along 
different directions, and we need to calculate each CO energy at a point correspond- 
ing to k values for each of these directions. This makes treatment of such problems 
considerably more complicated. It is convenient and conventional to handle all this in 
terms of vectors. We will outline the vector treatment here, using the two-dimensional 
rectangular lattice shown in Fig. 15-27a as an example. 

For a three-dimensional crystal, we label the translation vectors for a unit cell in 
real space a), a», and a3. Any point r in the unit cell can then be expressed in terms of 
vectors a. For the vectors in reciprocal space, we use the symbols bj, b2, and b3. The 
position K in reciprocal space can then be expressed in terms of vectors b. Vectors a 
have dimensions of length, and b of reciprocal length. 

The vectors b in reciprocal space are defined according to the formula 


b; «2z (aj x ay)/(aj:aj x ag) (15-18) 
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This makes each vector b; orthogonal to the plane of the “other” two a vectors, a; and 
az. In the case of a two-dimensional crystal, the missing vector a; is assumed to be a 
unit vector perpendicular to the plane of a; and a;. The denominator of Eq. (15-18) 
is a scalar with the same value in all three cases, so the length of b; is greater if the 
vector product of a; and ax is greater, i.e., if |a;||ax| sin $ is greater, where $ is the 
angle between a; and ax. In the example of Fig. 15-17a, all angles are the same, so 
the fact that a; is twice as long as a? makes b, half as “long” (in reciprocal distance) 
as b2. The "length" of b; is 2x /|a;|, which means that these vectors are of appropriate 
dimension to define edges for the first Brillouin zone. However, we will see that they 
need some adjustment in position. 

As is indicated in Fig. 15-27a, moving a lattice point through translations a generates 
the real, or direct lattice. Moving a point in reciprocal space through translations b 
generates the reciprocal lattice. Figure 15-27a and b indicate that a real lattice with 
orthogonal vectors a results in a reciprocal lattice with orthogonal vectors b but that 
the reciprocal lattice is rotated with respect to the real one. When the vectors a are not 
orthogonal, the reciprocal lattice may look quite different from the real one. 

According to Fig. 15-27b, the zone defined by vectors b runs from 0 to 27/|al. 
We seek an FBZ that runs from —7r/|a| to z/|a| in each direction. We can arrange 
this by drawing lines connecting one reciprocal lattice point to all its nearest neigh- 
bors and then bisecting all these lines with perpendicular bisectors. This gives the 
rectangle bounded by dashed lines in Fig. 15-27b. In essence, we are finding a cell 
in reciprocal space that fills up that space when translated by steps b and that also 
associates every point in reciprocal space with the reciprocal lattice point to which it 
is closest. This is sometimes called a Wigner—Seitz unit cell. It is also the FBZ for 
the crystal. In the example at hand, the FBZ looks like the zone defined by the b 
vectors, simply shifted, but in more complex systems, these may look quite different. 
Generally, the FBZ continues to have the symmetry of the real crystal. Thus, both the 
direct lattice and the FBZ in Fig. 15-27 are symmetric for reflections in the xz and yz 
planes. 

We have seen that one-dimensional systems have degenerate COs for equal values of 
|k| and so, if we wish to portray only the unique energies of the system, we need consider 
only the range from O to z/a. The analogous situation in two or three dimensions 
is that symmetrically equivalent positions k give degenerate COs. Hence we need 
consider only a symmetrically unique portion of the FBZ—the reduced first Brillouin 
zone (RFBZ). Since the FBZ is symmetric for the reflections mentioned above, the 
RFBZ is the quadrant shown in Fig. 15-27c. 

It is standard to use the label F for the central point of the FBZ. Certain other unique 
points are often labeled with symbols such as K, L, M, and X. These points often 
generate waves having the same periodicity as the crystal. 

If we choose a basis of one s-type AO on each real-space lattice point, then the COs 
resulting from the special reciprocal-space lattice points labeled in Fig. 15-27c appear 
as shown in Fig. 15-28. The point T has coordinate intercepts (or kı , k2 values) of (0,0), 
so exp(ik : na) = 1 and the s AO is translated without change from point to point, giving 
the totally bonding CO of Fig. 15-28a. Point X corresponds to (0, 2 /a2), meaning that 
no phase change occurs as we move through the direct lattice by steps of a; but that 
the phase reverses for each step by a» (Fig. 15-28b). Point M is for (x /a1, 7 /a2), so 
phase reversal occurs for a step by either a; or a», resulting in the totally antibonding 
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Figure 15-28 » Bloch sums based on an s-type AO at each lattice point in the two-dimensional 
rectangular lattice. (k1, k2) = (a) (0,0), (b) (0, z/a2), (c) Gr/a4, zt /a2). 


situation in Fig. 15-28c. The waves for points X and M can be seen to indeed have the 
periodicity of the lattice. 

Comparison of these three COs leads us to expect lowest energy at I', higher energy 
at X, and highest energy at M. A complete map of the CO energies would require 
a surface lying over the RFBZ. Instead of showing this surface, it is standard to plot 
the energy as a function of various straight-line paths connecting special points—e.g., 
from T to X, from X to M, from M to X’, from T to M. Such line plots are portrayed 
side by side on a common k axis, as in Fig. 15-29, even though the identity of k (k1, k2, 
or some combination) changes from panel to panel. 


E 15-16 Periodicity in Three Dimensions—Graphite 


Graphite is an important commercial material. Some of its uses derive from the fact that 
it is a very good conductor of electricity—nearly as good as metals. The most stable 
crystalline form of graphite (Bernal graphite) is depicted in Fig. 15-30a. It comprises 
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Figure 15-29 » Energy versus & along various paths in the RFBZ for the system of s-type AOs in 
a rectangular two-dimensional lattice. 


carbon atoms in two-dimensional sheets having hexagonal symmetry. These are stacked 
so that half of the carbons in a sheet lie directly between carbons in adjacent sheets 
while the other half lie between empty hexagon centers. The atoms within a sheet are 
strongly covalently bonded to their neighbors, with nearest-neighbor bond lengths of 
1.42 À. The sheets are separated by 3.5 À, indicative that the forces between them are 
very weak. (This explains why they fracture into sheetlike fragments that can slide 
over each other like a pile of playing cards, making ground graphite a good lubricant.) 
A sensible approach to understanding the electronic structure of graphite, then, is to 
first analyze the two-dimensional sheet and then consider the perturbation involved in 
forming layers of sheets. 

A section of a two-dimensional sheet, the translation vectors a; and a», and the 
primitive unit cell are sketched in Fig. 15-30b. The primitive unit cell contains two 
carbon atoms. The angle between a; and a? is 120°. Notice that the lattice points in real 
space, at the vector origin and at the termini of integral numbers of steps from the origin, 
fall at the centers of hexagons, where there are no atoms. This illustrates that there is no 
particular identification of direct lattice points with atoms or other structural features of 
the molecules that constitute a crystal. In the present instance the direct lattice, shown 
in Fig. 15-30c, looks like a graphite crystal that has been rotated by 30? with an extra 
point in the center of each hexagon. In general, even though the direct lattice need not 
look identical to the crystal, it will have the same symmetry as the crystal, in this case 
hexagonal. In Fig. 15-30d is shown the relation between the vectors a and the reciprocal 
space vectors b. Since b; must be perpendicular to a», etc., it follows that the angle 
between b vectors is 60°. The reciprocal lattice generated by the b vectors appears 
in Fig. 15-30e and is again a centered-hexagon pattern, but rotated with respect to the 
direct lattice. The FBZ is constructed by drawing lines from one reciprocal lattice point 
to all its nearest neighbors and then cutting them with perpendicular bisectors. This 
yields a hexagon, as shown in Fig. 15-30f. Also shown is the RFBZ. The entire FBZ 
can be filled by putting the RFBZ through the symmetry operations of the hexagonal 
sheet, so it includes all the points not equivalent by symmetry. 

Points of special interest in the RFBZ are labeled r, M, and K. The point T corre- 
sponds to the (kı, k2) values (0,0). This is a unique point in the FBZ, so it produces 
one nondegenerate Bloch sum for each basis function in the unit cell. The point M 
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Figure 15-30 » (a) The crystal structure of Bernal graphite. Each point represents a carbon atom. 
(b) Top view of a hexagonal sheet of graphite showing the unit cell and translation vectors for the 
two-dimensional structure. The "origin point" of the vectors does not correspond to an atom location. 
(c) The direct lattice produced by translations of the “origin point" of the a vectors. The full lines 
emphasize that this is a lattice of centered hexagons. The graphite structure is indicated with dashed 
lines. (d) Reciprocal lattice vectors b are shown in relation to vectors a. (e) The reciprocal lattice, 
produced by translating the origin point of the b vectors. (f) Construction of the FBZ by bisecting 
lines connecting one reciprocal lattice point to all of its nearest neighbors. The RFBZ is shaded. 
(g) Geometry of the relation between point K and the vectors connecting it to the origin. 


corresponds to (7 /a, 0). We might expect this to be a six-fold degenerate point because 
itis one of six symmetrically equivalent points in the FBZ. However, the point diamet- 
rically opposite this in the FBZ, at (—7r/a, 0), does not produce a separate independent 
Bloch sum because, if we combine these complex functions to form a real (cosine) 
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wave, the other real (sine) wave puts a node at every real lattice point, meaning that 
the basis function is multiplied everywhere by zero, which is not acceptable. This is 
similar to what we saw in the one-dimensional case, which led us to exclude —7r/a 
from the range of k. So here again we recognize that two points on opposite sides of 
the FBZ where perpendicular bisectors intersect lines between reciprocal lattice points 
really only account for one independent state. This means that the point M represents 
three independent Bloch sums rather than six. We shall have more to say about this 
later. The point labeled K does not lie on a b vector, so must be expressible as a sum 
of two vectors. Figure 15-30g shows the relevant construction. Since the lowest ray 
has length z/a, the central ray has length 2/(acos30°). Then the length of kı or k2 
must be z /a(2 cos? 30°), or 2z/3a. So (kj, k2) = (27 /3a, 20 /3a) at K. As will be 
explained later, K does not have reduced degeneracy as M does. 

Since there are two carbon atoms per unit cell and four valence AOs per carbon, 
we have eight basis functions per unit cell for a minimal valence basis set calculation. 
Therefore, we expect eight bands. Since the system is planar we can distinguish two 7 
AOs and six ø AOs. We know that these two symmetry types will not interact for any 
k, so we can predict or interpret the o and z band behaviors independently. 

Let us begin by making some predictions about the two z bands. As was the case 
for polyacetylene, the two basis sets that are simplest to work with are the bonding (7r) 
and antibonding (zg) combinations in a unit cell. We shall label the band resulting 
from the bonding function z and the other band z*. At the point I’, the unit cell basis 
functions are simply translated along a; and a2 with no change in phase, yielding the 
COs sketched in Fig. 15-31a. The x CO is bonding in every bond, z* is antibonding, 
so these will be widely separated in energy. Indeed, there is no way they could become 
more widely separated by intermixing, so we can assume that, if we were to do a 
variational calculation, these Bloch sums would not be modified. 

At point M, translation by each step along a; is accompanied by phase reversal 
because kı = 2/a,, whereas translation along a» involves no change because k» = 0 
(Fig. 15-31b). Now we find each carbon to be bonding to two neighbors and antibonding 
to the third in the 2 CO and the reverse in the z *CO. Once again, it is not hard to see 
that any mixing of these two would decrease their energy difference (Problem 15-24), 
so these Bloch sums should be unchanged in a variational calculation. Therefore, the zr 
and zr * bands should be separated in energy at M, but only by about one third as much 
as at T. As we mentioned above, the COs sketched in Fig. 15-31b are each degenerate 
with two other COs corresponding to symmetrically equivalent points in the FBZ, 
for example, the points (0, z/a) and (zz/a, —7:/a). Sketching these (Problem 15-25) 
shows that they are like the CO already sketched except rotated by +60°. 

Point K is more complicated to analyze because it has & values that do not produce 
COs having the periodicity of the crystal. This means that the decreased degeneracy 
we find for high-symmetry points like X in Section 15-15, caused because one of the 
real coefficient waves has a node at every unit cell, does not occur at K. (If the wave 
lacks the periodicity of the crystal, it cannot put a node at every unit cell, by definition.) 
That means we will have two z and two z* solutions at K. Also, the fact that these 
waves lack the periodicity of the crystal makes them more complicated to sketch. 
Nevertheless, undaunted, we plunge ahead. Since K corresponds to (2x /3a, 277 /3a), 
we expect steps along either a; or a? in real COs to be modulated by either cos(2nz /3) 
or sin(2nz /3). For n —0, 1, 2, 3, this gives the following repeating series for the cosine 
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(c) 
Figure 15-31 » x and x * COs at the point (a) T, (b) M, (c) K. 


and sine cases, respectively: 1, —0.5, —0.5, 1 and 0,0.866, —0.866,0. The results 
of using these factors to modulate translations of the zy and xg basis functions are 
sketched in Fig. 15-31c. Inspection of these sketches shows that the extent of bonding 
and antibonding interactions around any carbon exactly cancels in every case. This 
means that the zr and z* bands are degenerate at K. Each of the pairs of COs sketched 
in Fig. 15-31c is degenerate with two other pairs that are rotated by +60°. Therefore, 
the degeneracy of this energy level is 12. 

The band diagram for two-dimensional graphite is sketched in Fig. 15-32 for a circuit 
around the RFBZ. The z and zr* bands behave essentially as we anticipated, being 
widely separated at I’, much less separated at M, and degenerate at K. We note that 
there are three o bands at lower energy and three more at higher energy, corresponding 
to predominantly bonding and antibonding situations, respectively. However, these 
bands are less easily analyzed because the 2s, 2p, , and 2p, AOs undergo varying extents 
of mixing with each other as k varies. That is, these are bands where the variational 
method causes extensive mixing of Bloch sums. Nevertheless, we can see patterns that 
would not be difficult to investigate. For example, we can see that the z band and one 
of the ø bands run up from I to M, whereas the other two o bands run down. This is 
consistent with the fact that the 2s and 2p; AOs relate to corresponding AOs in adjacent 
unit cells in one way, while 2p, AOs relate in the opposite way (Problem 15-28). 
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Figure 15-32 »  Valence-band diagram for two-dimensional graphite. (Adapted from Painter and 
Ellis [6].) 


There are eight valence electrons per unit cell in neutral graphite. For the band 
diagram of Fig. 15-32 it is simple to figure out the band-population scheme. Eight 
electrons require four bands, and we see that the four lowest-energy bands lie below 
the other four bands for all values of k. This means that the four lowest-energy bands 
are filled and the others are empty. Therefore, the Fermi level comes at the top of the 
7 band, which occurs at K. We see that there is an empty band (zr*) at the same energy, 
so there is no gap at the Fermi level. Thus the two-dimensional lattice is predicted to be 
a good electrical conductor. We could examine the possibility that lattice deformation 
would split the levels to produce a gap (a Peierls distortion), but, since we have structural 
information indicating that graphite does in fact have hexagonal symmetry, we will not 
pursue that point. 

If there were crossing of one of the o* COs with the z CO over part of the range of 
k, then some fraction of the electronic population would occupy the low-energy portion 
of the o* band at the expense of the high-energy portion of the z band. Analysis of this 
would require more effort, but the result would still predict good conductivity since, 
instead of a full band “touching” the bottom of an empty band, we would have two 
partially filled bands. 

Next we consider the effects of stacking two-dimensional sheets to form the three- 
dimensional crystal.^ As Fig. 15-30a indicates, the stacking pattern repeats the ori- 
entation of a sheet after one intervening layer in an ABABAB stacking pattern. This 
means that the three-dimensional unit cell must contain carbon atoms from two layers. 
The unit cell now has three associated translation vectors. The two “intrasheet” trans- 
lations a, and a» are as before, and the “intersheet” translation a3 is perpendicular to 
the planes of the sheets and 7.0 A long. Because a is the longest vector in real space, 
b3 is the shortest vector in reciprocal space, leading to a reciprocal lattice where sheets 


^A discussion of effects due to other stacking arrangements can be found in LaFemina and Lowe [7]. 
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of centered hexagons are layered with short intersheet distances. The resulting FBZ, 
produced by perpendicular bisecting planes (since we are now working in three dimen- 
sions), appears in Fig. 15-33a. The RFBZ is similar to that in Fig. 15-30f except that 
it now has a third dimension, over which k3 ranges from 0 to x /a3 (Problem 15-26). 
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Figure 15-33 » (a) The FBZ and RFBZ for three-dimensional Bernal graphite. (b) View of the 
unit cell for Bernal graphite, as seen along an axis parallel to the hexagonal sheets. (Compare to 
Fig. 15-30a.) 
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Let us see if we can predict the modifications in energies of COs that should occur 
at some of the special points in the RFBZ as a result of intersheet interactions. Because 
three-dimensional Bernal graphite is symmetric for reflection through the plane of a 
hexagonal sheet, we continue to have a valid distinction between o and m COs. We 
expect intersheet interactions to be small, so we can take a perturbational approach. 
The interactions involving x COs should be largest since p; AOs project farthest from 
the plane, so we will restrict our attention to the 7 bands. 

A side view of the unit cell appears in Fig. 15-33b. Obviously, some of the intersheet 
interactions are already included in the unit cell, and the others will occur between unit 
cells. We will first consider the energy changes due to new interactions within the cell. 
These do not depend on & values. Then we will consider energy changes caused by 
stacking the unit cells. These depend on k3. We will ignore all interactions except those 
between carbons directly above or below each other in adjacent sheets. 

We begin with the situation where the z and ;zr* bands are nondegenerate, e.g., at I 
or M. We know from our two-dimensional treatment that each covalently bonded pair 
of carbons in the unit cell will have a zt, MO when we are dealing with the z band and 
a Xg MO for the x * band. (See Fig. 15-31a, b.) But we have a choice of arranging these 
in the unit cell to give long-range bonding or antibonding between layers (Fig. 15-34). 
These two arrangements cause the energies to rise or fall slightly (indicated by E + or 
E |). splitting both the x and z* bands. 

Next we examine what happens when we stack these unit cells. Consider first the 
point I’. Here k3 =0, so we stack the unit cells with no change. If we start with Tanti 
of Fig. 15-34a, we obtain Fig. 15-35a, which shows that the unit cells interact in an 
antibonding manner. Here, then, is a Bloch sum that is antibonding within and also 
between unit cells. The symbol E 7 indicates two destabilizing interactions. Each 
arrow represents the same amount of energy because the interlayer distance within 
a unit cell is the same as that between unit cells and also because there are equal 
numbers of inter- and intralayer interactions in the crystal. Similar sketches easily 
demonstrate how the other three unit-cell function sets of Fig. 15-34 behave at T': 
Toonds E 44; Wins ETT Mona E 44- Therefore, at I, the x and z* bands are both 
split. 

Now let us consider point A. Here we stack the unit cells together with a reversal 
in sign. The situation for Tanti is shown in Fig. 15-35b. The intercell interaction has 
reversed from the k3 = 0 case. This means that the E ++ and E | |, cases from F now 
are E ^ | and E | 1. The levels are not split by interlayer effects at A. 

When we consider points M and L, we again are dealing with the same zy and zr, 
functions in the unit cell (see Fig. 15-31b), so we obtain the same results with k3 = 0, 
or zt /a3. The zr and x * bands are split at M by the same amount as at I’, and at L they 
are not split at all. 

At point K, the z and z* bands are degenerate. This means we must ask which are 
the proper zeroth-order two-dimensional COs with which to analyze the perturbation. 
We know that these will be four COs produced by mixing the four COs shown in 
Fig. 15-31c. This mixing is determined by solving a 4 x 4 determinantal equation, 
which, as was pointed out in Chapter 12, is the same equation we would use to discover 
the COs having minimum and maximum energy changes as a result of the perturbation. 
As a shortcut, then, we can simply look for the COs that respond most differently. Our 
analysis at I and M tells us that the zy and zg bases will give a splitting of all the 
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Figure 15-34 » The four independent arrangements of z (or 7u) and z'* (or Tg) functions in a unit 
cell. Labels “anti” and “bond” refer to the interactions between AOs directly above/below each other. 
E ^t and E | indicate whether these interactions raise or lower the energy. 


COs. Not bad, but we can do even better by mixing the COs pictured in Fig. 15-31c. 
Adding and subtracting the cos z* and cos a COs produces two new functions whose 
unit cells (in two dimensions) have a 2 py AO on one carbon and nothing on the other, 
and vice versa. (In effect, we are taking the sum and difference of the zy and srg basis 
functions that we constructed in the first place.) These new basis functions give us 
four new unit cell bases for the three-dimensional case, shown in Fig. 15-36. Two of 
these place all the 2p; AOs on atoms that lie directly above and below each other, so 
these rise or fall in energy due to the intracell interactions. The other two place all 
the 2p; AOs on carbons that do not have neighbors directly above or below them, so 
these do not undergo energy change. The rise and fall here are larger than for the cases 
pictured in Fig. 15-34 because the AO coefficients are larger when the basis set exists on 
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Figure 15-35 » (a) The result of translating a Tanti unit-cell basis function one step along a3 with 
no change of sign or value (k3 = 0). (b) Same as (a) except now sign reverses (k3 = x /a). 


fewer atoms. These, then, are the proper zeroth-order functions with which to analyze 
the stacking perturbation. (The argument here is equivalent to saying that a calculus 
min-max problem will select roots —2, 0, 0, +2 in preference to —1, —1, +1, +1.) 

When we put these unit-cell bases together at K, where k3 — 0, we find that the func- 
tions that do not interact within the cell do not interact between cells either (Fig. 15-372). 
These two cases remain unsplit in energy. The cases that interact within the cell inter- 
act again and in the same way between cells, so these two cases are split even more 
(Fig. 15-37b). At L, the intercell interactions reverse but the intracell interactions are 
unchanged. The E” cases are unaffected, but E ^ and E | become E tJ and E 4f 
(Fig. 15-37c), just as at A and H, so the splitting disappears at L. 

A sketch of the x and ;zr* bands for a route around the edges of the RFBZ appears 
in Fig. 15-38. In general, no splitting of these bands occurs for points on the top of the 
RFBZ. For points on the bottom, splitting is larger at K than at F or M. 

Computing the electronic energy per unit cell for graphite requires knowing the 
average energy of the occupied bands, not only over the paths shown but also over the 
interior points of the RFBZ. The method of Chadi and Cohen [5] can be used to select 
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Figure 15-36 » The four independent unit-cell functions that result from requiring maximum dif- 
ference in response to the interlayer interaction. E' indicates no energy change. 


an optimized set of weighted points in the RFBZ for purposes of calculating accurate 
average energies. 

One might wonder whether the splittings depicted in Fig. 15-38 have any detectable 
consequences. One change that occurs as a result of this splitting is that the Fermi level, 
Erp, now refers to x electrons in COs that are not split along K—H by the perturbation, 
i.e., in COs like the one pictured in Fig. 15-37a. This means that the electrons having the 
highest energy are predicted to be in 2 p; AOs at carbons that do not have carbon atoms 
directly above or below them in the crystal. This comes about because the electrons that 
do have such neighbors are lowered slightly in energy due to weak bonding between 
layers. This is relevant because there exists an experimental technique called scanning 
tunneling microscopy (STM) that detects the locations of surface electronic charge 
having energy near the Fermi level. When STM maps are made of electron distribution 
at a clean graphite surface, a trigonal pattern is seen, rather than the expected hexagonal 
one. Evidently, not all the atoms are "seen" equally well. The trigonal pattern is 
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Figure 15-37 » (a and b) Result of translating two of the functions from Fig. 15-36 along a3 at 
point K (k3 = 0, so no change in function). (c) Same as (b) except now at point L (k3 = 7 /a, so 
function reverses sign). 


consistent with more charge being detected over every-other atom. If the device is 
tuned to include electronic charge from deeper energies, the familiar hexagonal pattern 
of graphite emerges. This suggests that the former experiment detects charge in the 
unsplit, horizontal band shown between K and H, corresponding to COs on atoms 
without neighbors in the adjacent layer, and that the latter measurement detects charge 
in both that band and the one just below it. (Of course, the situation at the surface is not 
identical to that in the bulk, but surface carbons having carbons directly below them 
will still have their 2 py energies lowered somewhat by the weak bonding interaction.) 


NH 15-17 Summary 


Understanding or predicting the nature of the electronic band structure for a periodic 
material requires that we identify a unit cell and accompanying basis set in real space 
and then move this basis set along translation vectors a, possibly with accompanying 
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Figure 15-38 » Sketch of x and zx* band energies for a circuit around the RFBZ of three- 
dimensional graphite. Heavy dots at T, K, H, and M represent energies for the isolated sheet. 
Splittings at T and M should be about half that at K. 


rotations. Each time the basis set is moved to a different cell, it is multiplied by a 
coefficient that can be taken from a formula of the type cos(knz), where n is the 
number of steps taken through a and & is the wavenumber for the wave that determines 
the coefficient. This process generates Bloch sums. Sometimes Bloch sums are the 
same as the COs, sometimes they are not. The reasons they might be different are 
that (1) there may be intended correlations that violate the noncrossing rule and (2) the 
basis set mixture in a unit cell may change with k in order to take better advantage of 
the changing interaction between unit cells, even when there is no problem with band 
crossings. 

Degeneracies at the Fermi level result in Peierls distortions in one-dimensional sys- 
tems, opening a band gap. Degenerate levels, whether at a gap or not, also allow 
one to predict which energies at the edges of the band diagram will be unaffected by 
certain chemical substitutions. Both of these phenomena are easily analyzed using 
the principle of “maximum difference in response" from degenerate-level perturbation 
theory. 

The values of k giving independent COs fall into a zone of reciprocal space called 
the first Brillouin zone. A subspace within it, called the reduced first Brillouin zone 
(RFBZ), contains all the k values that are needed to calculate every energy in the 
band system. Certain points at corners of the RFBZ are of special interest because 
they are easy to use for constructing Bloch sums. They often give waves having the 
same periodicity as the real lattice, which means that the cosine Bloch sum will give 
alternating positive and negative coefficients at each unit cell and the sine Bloch sum 
will give zero at each unit cell, hence not exist. This results in reduced degeneracy. 

Points in the FBZ that are related by symmetry to a point in the RFBZ correspond 
to COs that differ by a symmetry operation (e.g., a rotation) from the CO produced by 
the point in the RFBZ. 

Readers interested in further treatment of either the chemical or physical aspects of 
periodic systems should consult references at the end of this chapter [1-11]. 
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15-17.A Problems 


15-1. 


15-2. 
15-3. 


15-4. 


15-5. 


15-6. 
15-7. 


15-8. 


15-9. 


15-10. 


15-11. 


15-12. 


Work out the relations in Eq. (15-3) (for a particle of mass m) so that it can be 
written as equalities rather than proportionalities. 


Derive the functional form for Fig. 15-2b. 


a) Obtain MOs and energies for benzene using Eqs. (15-4) and (15-5) with 
j20—5. 

b) Show that the results for 7 = 6 are identical to those for j — 0. 

a) Derive the mathematical formula for the curve of Fig. 15-5b. 

b) What would this function be if DOS/N were constant over the energy range? 


For benzene, j= 1 and j= —1 give different complex degenerate MOs. Produce 
these and find linear combinations that are real and orthogonal. Sketch these 
real MOs and compare them with those tabulated in Appendix 6 for the same 
energy. 


Demonstrate that Eq. (15-10) is correct if N is an integer. 


Using sketches, show that the Bloch sum for (C2H5), at |k| = x /3a is different 
from that for (CH)»; at |k| = 77/6a. Which do you think has the lower energy? 
What will happen in a variation calculation on each of these two cases? 


Suppose an infinite straight chain of atoms existed with a band of s orbitals that 
was one-third filled. Indicate what sort of bond-length pattern should result 
from a Peierls distortion. 


Figure 15-14 indicates that the 2s and 2 pọ Bloch functions avoid crossing. Use 
symmetry to argue why this is so. 


In connection with Fig. 15-18b, the text indicates that the avoided crossing 
between o levels for alternating bond lengths becomes an allowed crossing 
when uniform bond lengths are used. This means that there is a symmetry 
disagreement for uniform bond lengths that does not exist for alternating bond 
lengths. The new symmetry element that comes into existence with uniform 
bond lengths is a two-fold screw axis. (The operation corresponding to this 
element is to rotate the polymer by 180? about the axis and also to translate 
it parallel to the axis.) Show, using sketches, that these o bands disagree in 
symmetry for such a symmetry operation. 


Crystalline H2 (H-H - -- H-H - - - H-H-) is an insulator. Itis thought that it should 
become a metallic conductor at extremely high pressures. Explain why this is so. 


A heteronuclear diatomic molecule A-B crystallizes end to end to form a linear 
chain: ... A-B --- A-B - - - A-B. The internuclear distance between molecules 
is significantly longer than that within molecules. The monomer has a bonding 
valence c MO and a higher energy o* MO: 


Sketch a band structure diagram for the crystal showing qualitatively the 
energies of the crystal orbitals as a function of |X| in the first Brillouin zone. 
Sketch the appearances of the COs for the ø and o* bands at |k| — 0 and x/a. 


Section 


15-13. 


15-14. 


15-15. 


15-16. 


15-17. 


15-18. 


15-19. 
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André and Leroy [2] report an ionization energy of 8.19eV and an electron 
affinity of —4.09 eV (i.e., energy is released in forming the anion) from apply- 
ing Koopmans' theorem to their ab initio calculations on trans-polyacetylene 
with regular bond lengths. a) What is the energy of the HOMO? b) Of the 
LUMO? c) What is the value of the energy gap? d) What should happen to the 
values of these three numbers when the bond lengths are allowed to alternate? 


It is stated in Section 15-11 that it is improper to interpret a Hückel band 
diagram having zero gap as meaning that the ionization energy (from the 
HOMO energy) equals the electron affinity (from the equal LUMO energy). 
Yet we do use HOMO and LUMO Hückel energies to estimate the IE and EA 
in Chapter 8. What is the explanation for this apparent disagreement? 


In a variational EHMO calculation on PPP, the coefficients for mı at k = 0 
are 0.37 at linking carbons (1 and 4) and 0.28 at the others. This results from 
mixing the Bloch function zr, at k = 0 (with all coefficients the same) with 
another Bloch function at k = 0. Which Bloch function, 7x, can mix with 71 
at k — 0 to accomplish this modification? (See Fig. 15-23 for sketches.) What 
happens to the energy of the zt, band at k = 0 as a result of mixing with 71? 
What happens to the coefficients in 7? 


When PPP-N2 is substituted at position 6, the band energy values of —13.94eV 
and —5.64 eV at k = x /a are not affected. Would you expect the same thing to 
happen if substitution occurred at position 5 instead of 6? Why or why not? 


The PPP diagram of Fig. 15-23 is constructed from Bloch sums built from 
benzene MOs. We could have used the MOs for the other choice of unit cell 
shown in Fig. 15-21a. Obtain the appropriate unit cell simple Hückel MOs and 
energies from Appendix 6 and construct the appropriate PPP band diagram. 
Discuss the differences with Fig. 15-23. 


Predict the first-order Hückel-level energy band diagram for 


O-O-O- 


Show the actual band-edge energies in terms of œ and f. Show how the 
edge energies at k = 0 and z/a would shift if substitution occurred in all 
the nonlinking sites so that a’ = a + 0.250, with no change in B. What if 
substitution occurred in only one nonlinking site in each monomer? 


Sketch the CO for point X" in Fig. 15-27c for s-type unit-cell basis functions. 
Comparing with Fig. 15-28, how would you expect the energy of this CO to 
compare to the energies of COs at T, X, and M? 
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15-20. Figure out the (k1, k2) coordinates and sketch the COs for s-type basis functions 
for the following points in the RFBZ of Fig. 15-27c. a) Halfway between X 
and M. b) Halfway between X' and M. () Halfway between T and X. d) 
Halfway between I' and M. Rank these in order of energy. 


15-21. Sketch COs based on a 2 p, AO at each lattice point for the RFBZ of Fig. 15-27c, 
at points D, X, and X’. Rank according to predicted energy. 


15-22. What degeneracy would you anticipate for energies associated with points 
T, X, M, and X' of Fig. 15-27c?. 


15-23. The rectangular lattice of Fig. 15-27a leads to a rectangular RFBZ. What shape 
RFBZ would result for a square lattice? 


15-24. Itis stated in the text that mixing z (M) and zt * (M) of Fig. 15-31b will decrease 
the difference in their energies. Sketch the functions zz (M) +2*(M) and 
comment on their energies. 


15-25. For the FBZ of Fig. 15-30f, locate the points (0, z/a) and (z/a, —/a). 
Sketch the 2 CO corresponding to the first of these and compare it with the 7 
CO for (zt /a, 0), sketched in Fig. 15-31b. How are these COs related? 


15-26. What physical situation allows us to use the half-range of 0 < k3 < x /a3 for 
the RFBZ pictured in Fig. 15-33a? 


15-27. Give a general argument for there being no interlayer-induced splitting of 
7 bands for any point at the top of the RFBZ of Fig. 15-33a. What does your 
argument predict for o bands? 


15-28. For two-dimensional graphite, sketch the Bloch sums at I and M resulting 
from a bonding pair of 2p, AOs in each unit cell. Let x be the bisector of a, 
and a». Which of these COs would you expect to have higher energy? 
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Useful Integrals 
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oo n 
x" e dx = (nle *? [a 1) S (ay) / kt 
k=0 


T— 


Hl 
e “dx = (1/aY(e? — e *) 


| 
E 


H1 
xe “dx = (1/a?)[e? — e^? — ale? + e^?)] 


Fl 


x" e ^ dx = (—1)""! A, (~a) — An (a) 


n=l, 23... 
i | 


| 
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2/(n+ 1), n=0,2,4,... 


| 
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sin x dx = — cos x 


cos x dx = sin x 


in 2. 

du xut. e 2 
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2 X  sin2x 
cos“ x dx = = 
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x sin x dx = sin x —xcosx 


x cos x dx = cosx + x sinx 


hae ae See ee I ec e EU EL 


NUMEN E: sin2x — cos2x 
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Appendix 2 


Determinants 


A determinant is a scalar calculated from an ordered set of elements according to a 
specific evaluation recipe. The elements are ordered in a square array of rows and 
columns, bounded at left and right by straight vertical lines. For instance, (A2-1) is a 
2 x 2 determinant: 


x —i 


A2-1 
» y (A2-1) 


The recipe for evaluating a 2 x 2 determinant is: From the product of the elements 
on the principal diagonal (upper left to lower right) subtract the product of the other 
two elements. Thus, (A2-1) has the value x y? + 2i. 

Larger determinants are evaluated by a process that reduces them step by step to a 
linear combination of smaller determinants until, finally, they are all 2 x 2’s, which are 
then evaluated as above. The process of reduction involves the concept of a cofactor. 
As our example, we use the 4 x 4 determinant (A2-2), symbolized | M|, where M is the 
array of elements within the vertical bars: 


411 012 dig Qi 


|M| = (A2-2) 


The elements are numbered so that the first index tells which row, and the second 
index which column, the element is in. The cofactor of element a1, is defined as the 
determinant obtained by removing the row and column containing a11. We see in (A2-2) 
that striking out row 1 and column 1 gives us a 3 x 3 determinant (dashed outline) as 
cofactor of a11. Symbolize this cofactor as |411|. 

To evaluate the determinant |M|, we expand in terms of cofactors. We begin by 
choosing any row or column of M. (We will choose row 1.) Then we write a linear 
combination containing every element in this row or column times its cofactor: 


|M| = aii A11] — 2121] 412] + 13] 413] — a14| 414] (A2-3) 


The sign of each term in the linear combination is determined as follows. If the sum of 
row and column indices is even, the sign is plus. If the sum is odd, the sign is minus. 
Since the indices of a1» and a14 sum to odd numbers, they are minus in (A2-3). 
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The method of expanding in cofactors is successively applied until a large determi- 
nant is reduced to 3 x 3's or 2 x 2’s that can be evaluated directly (see Problem A2-1). 
Thus, a 5 x 5 is first expanded to five 4 x 4’s and each 4 x 4 is expanded to four 
3 x 3's giving a total of 20 3 x 3's. This method becomes extremely clumsy for large 
determinants. 

Some useful properties of determinants, symbolized |M|, are stated below without 
proof. The reader should verify that these are true using 2 x 2 or 3 x 3 examples, or by 
examining Eq. (A2-3). 


1. Multiplying every element in one row or one column of M by the constant c multi- 
plies the value of | M| by c. 


2. If every element in a row or column of M is zero, then | M| = 0. 


3. Interchanging two rows or columns of M to produce M' results in |M | =—|M|; 
i.e., it reverses the sign of |M]. 


4. Adding to any row (column) of M the quantity c times any other row (column) of 
M does not affect the value of the determinant. 


5. If two rows or columns of M differ only by a constant multiplier, then | M| = 0. 


A2-1 Use of Determinants in Linear Homogeneous Equations 


Suppose that we seek a nontrivial solution for the following set of linear homogeneous 
equations: 


ajx - biy-- ez — 0 (A2-4) 
ax + bay-d- coz — 0 (A2-5) 
a3x + b3y+c3z = 0 (A2-6) 


Here, x, y, and z are unknown and the coefficients aj, bj, c; are given. Let us collect 
the coefficients into a determinant | M |, 


aj b cj 
|M|— |a? b2 © (A2-7) 
a3 b3 c3 


As before, let |A1| be the cofacter of a1, etc. Now, multiply Eq. (A2-4) by |A1| 
(since | 41| is a determinant, it is just a number, and so this is a scalar multiplication), 
Eq. (A2-5) by —| 42|, and Eq. (A2-6) by |43| and add the results to get 


x(a1| 41] — a2] 42| + a3| 43D + (511 41] — 52142] + 53143) 
+z(cı|Aı|— c2|42| + c3|43|\)=0 — (A2-8) 


The coefficient of x is just |M|. The coefficients of y and z correspond to determinants 
having two identical rows and hence are zero. Therefore, 


|M|x=0 (A2-9) 
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In order for x to be nonzero (i.e., nontrivial) it is necessary that | M| — 0. This is a result 
that is very useful. The condition that must be met by the coefficients of a set of linear 
homogeneous equations in order that nontrivial solutions exist is that their determinant 
vanish. 


A2-2 Problems 


A2-1. Expandthe 3 x 3 determinant (Fig. PA2-1), by cofactors and show that this result 
is equivalent to the direct evaluation of the 3 x 3 by summing the three products 
parallel to the main diagonal (solid arrows) and subtracting the three products 
parallel to the other diagonal (dashed arrows). 


1201 
0 Fund 3014 2 
A2-2. Evaluate (a) (b) 0 1 1| (o (a) ^20 forx 
2:3 1101 1 
111 
0211 


A2-3. Verify that the coefficient of y in Eq. (A2-8) is zero. 
A2-4. Consider the following set of linear homogeneous equations: 
4x+2y—z=0, 3x—y-2z-20, 2y+z=0 
Do nontrivial roots exist? 
A2-5. Find a value for c that allows nontrivial solutions for the equations 


cx —2y+z=0, 4x+cy—2z=0, —-8x+5y—cz=0 


A2-6. Five properties of determinants have been listed in this appendix. (a) Prove 
statement (5) is true assuming statements (1)-(4) are true. (b) Demonstrate 
statements (1)-(4) using simple examples. 


Appendix 3 


Evaluation of the Coulomb Repulsion 
Integral Over 1s AOs 


Evaluation of 


Ii 1s(1)1s(2)(1/r12)1s(1)1s(2) dv(1) dv(2) (A3-1) 


1s(1) 2 J £3/z exp(—£r) (A3-2) 


may be carried out in two closely related ways.! Each method is instructive and sheds 
light on the other, and so we will give both of them here. 

The first method works from a physical model and requires knowledge of two features 
of situations governed by the 1/7? force law (e.g., electrostatics, gravitation). Suppose 
that there exists a spherical shell in which charge or mass is distributed uniformly, like 
the soap solution in a soap bubble. The first feature is that a point charge or mass 
outside the sphere has a potential due to attraction (or repulsion) by the sphere that 
is identical to the potential produced if the sphere collapsed to a point at its center 
(conserving mass or charge in the process). Thus, the electrostatic interaction between 
two separated spherical charge distributions may be calculated as though all the charge 
were concentrated at their centers. The second feature is that the potential is identical 
for all points inside the spherical shell; that is, if the core of the earth were hollow, a 
person would be weightless there. There would be no tendency for that person to drift 
toward a wall or toward the center. 

Armed with these facts, we can evaluate the integral. First, we remark that all the 
functions in the integrand commute. This enables us to write Eq. (A3-1) as 


Ii 1s? (0) (1/r12)1s? Q)dv(1)dvQ) (A3-3) 


where 


The functions 1s?(1) and 1s?(2) are just charge clouds for electrons 1 and 2, and 
the integral is evidently just the energy of repulsion between the clouds. Suppose 
(see Fig. 3-1) that, at some instant, electron 1 is at a distance rı from the nucleus. What 
is its energy of repulsion with the charge cloud of electron 2? The charge cloud of 
electron 2 can be divided into two parts: the charge inside a sphere of radius rı and the 
charge outside that sphere. From what we just said, electron 1 experiences a repulsion 
due to the cloud inside the sphere that is the same as the repulsion it would feel if that 


l Other methods, not discussed here, also exist. See, for example, Margenau and Murphy [1, pp. 382-383]. 
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Charge cloudis?(2) < 


Figure A3-1 » Sketch of spherical charge cloud 1s?(2) with electron 1 at a distance rı from the 
nucleus. 


part of the cloud were collapsed to the center. We can calculate the energy due to this 
repulsion (call it the “inner repulsion energy") by dividing the product of charges by 
the distance between them: 


inner repulsion energy = (fraction of charge cloud 2 inside r1) x 1/7 


ri 
= 4r / 1s? (2)r?dr» x Mri (A3-4) 
0 


where the factor 427 comes from integrating over 05, $». Electron 1 also experiences 
repulsion from charge cloud 2 outside the sphere of radius rı. But, from what we said 
above, electron 1 would experience this same “outer repulsion” no matter where it was 
inside the inner sphere. Therefore we will calculate the energy due to this repulsion 
as though electron 1 were at the center, since this preserves spherical symmetry and 
simplifies the calculation. It follows that all the charge in a thin shell of radius rz repels 
electron 1 through an effective distance of r2. Integrating over all such shells gives 


oo 
outer repulsion energy = 47 if (1/ ry) 1s? 2)r?dr» (A3-5) 
n 


The total energy of repulsion between charge cloud 2 and electron 1 at rı is the sum 
of inner and outer repulsive energies. But electron 1 is not always at rı. Therefore, 
we must finally integrate over all positions of electron 1, weighted by the frequency of 
their occurrence: 


oo ri 
repulsive energy = 167? | 1s?°(1) lam f 1s?(2)r3dr2 
0 0 


oo 
+ J Is*Q)adra} ĝar (A3-6) 
7| 
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Figure A3-2 » 


The inner integrals contain but one variable, r2, and can be evaluated with the help of 
Appendix 1. After they are performed, the integrand depends only on rı and this is also 
easily integrated yielding 57/8 as the result. A positive value is necessary since a net 
repulsion exists between two clouds of like charge. 

The second method of evaluation is more mathematical and more general. The func- 
tion 1/72 is expressible? as a series of terms involving associated Legendre functions: 


oo -H 1 
1 Q—|mp! re mI |m| 
m = y ) 3 Impr PLU (cos 01) P; (cos 62) exp[im(ó1 — $2)] | (A3-7) 


1=0 m=-1 


This infinite series will give the distance between particles 1 and 2 located at positions 
71,01, $1 and r2, 02, d» (Fig. A3-2). All we need to do is pick the larger of rı and r2 and 
call that r.., the other being r<, and substitute those numbers into the formula. We also 
need all the Legendre functions for cos 0; and cos 62, and values of exp[im($1 — ¢2)] 
for all integral values of m. Putting all this together as indicated by Eq. (A3-7) would 
give a series of numbers whose sum would converge to the value of 1/745. While this is 
an exceedingly cumbersome way to calculate the distance between two points, it turns 
out that use of the formal expression (A3-7) enables us to integrate Eq. (A3-1). This 
comes about because the Legendre functions satisfy the relation 
x ! 

[ BJ" (cos 9) P," (cos 8) sin 6 d0 = = eae. (A3-8) 
The first Legendre polynomial Po is equal to unity. Therefore, 1s(i), which has no 
0 dependence, can be written 


1s(i) = (£?/1)!? exp(—£ri) Po (cos 6;) (A3-9) 


Thus, in the integral (A3-1) there will be an integration over 0; of the form 
T 
Í Py (cos 61) P|”! (cos 01) Po (cos 61) sin 01 dO, (A3-10) 
0 


for each term in the sum (A3-7) (and a similar integral over 02). However since 
1? = 1, [P9(cos0))? = Po(cos 61), and integral (A3-10) becomes [by Eq. (A3-8)] 


T 
/ Po(cos 61) P”! (cos 61) sin 01 d1 = 280) (A3-11) 
0 


?$ee Eyring et al. [2, Appendix 5]. 
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[m must equal zero here, otherwise the integral over $ will vanish.] In other words, all 
terms of the sum over / and m vanish except the first term, for which / = m — 0. This 
gives that the 1/72 operator is equal to 1/r.. Hence, the repulsion integral is 


oo oof] 2 DI 2 2 2:2 
167? Í / ( WEE 3 ara (A3-12) 
0 0 r> 


where r~ is the greater of r1, 72. Suppose that we integrate over r first. As r2 changes 
value, it is sometimes smaller, sometimes larger than a particular value of r1. When it 
is larger, r> is r2. When it is smaller, r~ is rı. Putting this argument into mathematical 
form gives 


ee pp 1? (112 2)r]rddrg | [ ert | an dou 
0 0 r 


r] i r 


Since the variable of integration in the two inner integrals is r2, the quantities 1s?(1), "s 
and r, may be brought outside these inner integrals, giving us the same equation (A3-6) 
that we obtained by the first method. This second method is more generally useful 
because it can be used when repulsions involving p, d, etc. charge clouds are calculated. 
In these cases, terms involving / — 1,2, etc., become nonvanishing, but the series 
generally truncates after a few terms. 
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Appendix 4 


Angular Momentum Rules 


[LJ A4-1 Introduction 


In Chapters 4, 5, and 7, we deal with the angular momentum of electrons due to 
their orbital motions and spins and also with angular momentum due to molecular 
rotation. The conclusions we arrive at are based partly on physical arguments related 
to experience with macroscopic bodies. While this is the most natural way to introduce 
such concepts, it is not the most rigorous. In this appendix we shall demonstrate how 
use of the postulates discussed in Chapter 6 leads to all of the relationships we have 
introduced earlier. The approach is based entirely on the mathematical properties of 
the relevant operators, making no appeal to physical models. 


E A4-2 The Classical Expressions for Angular Momentum 


According to postulate II (Section 6-3), we must first find the classical physical express- 
ions for the quantity of interest in terms of x, y, Z, Px, Py, pz, and t. We know that the 
angular momentum for an object of mass m moving with velocity v in a circular orbit 
of radius r is 


L=mrxv=rxp (A4-1) 


where the x symbol means we are taking a cross-product of vectors. To examine this 
in detail, we must resolve r and p into x, y, and z components: 


r=ix+jy+kz (A4-2) 
dx dy dz 

=mv=mi— +mj— +mk— =ipy + jpy kp: A4-3 

p=mv n my m dt 1px rJPy P. ( ) 


Here i,j,k are unit vectors pointing, respectively, along the x, y, z Cartesian axes. 
A cross product is taken by expanding the following determinant: 


i jk 
L=rxp=|x yz (A4-4) 
Px Py Pz 
to give 
L=i(ypz — zpy) + jCpx — x pz) + K(x Py — ypx) (A4-5) 
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The coefficient for i is the magnitude of the x-component of angular momentum, etc. 
That is, 


L= Li+ L,j + L-k (A4-6) 
where we now see that 
Lx = ypz — zpy (A4-7) 
Lys 2py — xpz (A4-8) 
L,=XPy— ypx (A4-9) 


(The simple cyclic x, y, z relationships among these formulas can be used as a memory 
aid.) We will also be interested in the square of the angular momentum, Z?: 


I^-L.L-(Ldà-cLyj- L-k)(Lri+ LyjL;k)—LI-LL-L2 — (A410) 


where we have used the orthonormality of the unit vectors (i-i=1,i-j=0, etc). Note 
that L2 isa magnitude, not a vector. 

This gives the classical expressions we need for the x, y, and z components of angular 
momentum and for the square of its magnitude. 


[J A4-3 The Quantum-Mechanical Operators 


Postulate II tells us what to do next: Replace py with (//7)9/0x, and similarly for 
Py, pz. The resulting operators are: 


L,—-ih Ç 5.) (A4-11) 

pez E 23 (A4-12) 

L,—-ih (ss. Š) (A4-13) 
and 

PREi, (A4-14) 


All of these operators are hermitian. 
Now that we have the quantum-mechanical operators, we are free to transform them 
to other coordinate systems. In spherical coordinates, they are! 


2 a a 
Ly — ih (sino +ctdc0s6) (A4-15) 
L ih $ 2 tÓ sin 2 (A4-16) 
== COS Ó — — CO sin o — = 
Ti Tm y, 


'See Eyting et al. [1, p. 40.] 
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‘ 0 

L.— Eur (A4-17) 

a 3? a |.» 

Ê? R^ F cot 0 + = 
302 00 sin? 6 ð$? 
1 1 2 

E UR NP RENE (A4-18) 

sin 00 00 sin? 6 ð$? 


In atomic units, the quantity A becomes unity and does not appear. We will use 
atomic units henceforth in this appendix. 


E A4-4 Commutation of Angular Momentum Operators with 
Hamiltonian Operators and with Each Other 


We have indicated (Chapter 4) that a rotating classical system experiencing torque 
maintains E, Lz, and |L| (or, equivalently, L?) as constants of motion, but not L, or L y 
We might anticipate a similar situation in quantum mechanics. This would mean that 
a state function yy would be an eigenfunction for H ; L?, and i but not for Ê x Or L y 


This in turn requires that H ! £2, and he commute with each other, but that L x and L y 
do not commute with all of them. 

We first consider H and £?. Note that Ê? appears in V? [compare Eq. (A4-18) 
with (4-7)]: 


Ve ye (A4-19) 


Also, since L? does not contain the variable r, L? commutes with any function depend- 
ing only on r. Since Ê? must also commute with itself, it follows that L? and V? 
commute, i.e., that 


ba v? =0 (A4-20) 
If V in a hamiltonian operator is a function of r only, then 


|e. Al =0 (A4-21) 


This proves that A fora hydrogenlike ion commutes with £2. 
From the expressions for L? and L; in spherical polar coordinates, it is obvious that 


ps P =0 (A4-22) 


L; does not operate on functions of r, and so Ê, commutes with V2. For a spherically 
symmetric system, V = V (r) and we have that 


[iz À| =0 (A4-23) 


Thus, we have shown that 7, £?, Ê, all commute in a system having a spherically 
symmetric potential. 
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Now let us test Ê x and ja y With each other and also with E. 
[ês i » edic (A4-24) 


We will examine this using the Cartesian system. We write out each operator product 
and subtract: 


i ð a ð a 
= z Z 
a Yaz dy ox ET 


à a? 9? a? a? 
=—(y—+yz 2 yx + zx (A4-25) 
ax dz0x 9yàx az dydz 
"T 3? 32 32 9 a? 
LyLy— (» xy z Fx +xz ) (A4-26) 
üxüz oz? üxQüy Oy dzdy 


Subtracting (A4-26) from (A4-25): 


E ji | NN LAE UN (A4-27) 
, = — —-X— JH (| xX— — V— J KIL, 5 
To . Ox oy oy Z ax 
A similar approach to other operator combinations gives 
[ " i| Sah, (A4-28) 
Batalia (A4-29) 


(Notice the x, y, z cyclic relation.) These commutation relations do not depend on 
choice of coordinate system. Use of the r, 0, $ coordinate system would give the 
same results. Evidently, these operators do not commute with each other since their 
commutators are unequal to zero. 

From this point we will dispense with the carat symbol, since the context of the 
discussion makes it obvious that we are referring to operators. 

We still have not checked L? with L, and L y. We will now show that 


[L.. 22] - |. G2 22. 2| 2o (A4-30) 


We proceed by finding the commutator of Ly with L2, L2, and L? individually. It is 


obvious that the first of these, L,, p equals zero. The second can be evaluated as 
follows. 


LyLy — LyLy — iL; (from A4-27) (A4-31) 
We multiply (A4-31) from the left by Ly: 
LyLyLy— Leds =iL,L; (A4-32) 
We multiply (A4-31) from the right by L,: 


ILS, — L,LyLy=iL;Ly (A4-33) 
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We sum (A4-32) and (A4-33): 
LL} - DL, - | Ls 2| oii tse Luo) (A4-34) 


A similar strategy, starting with (A4-29), multiplying from left and right by L,, and 
summing, yields 


Lb, b SIL eU by Pgh) (A4-35) 


Equation A4-34 minus A4-35 plus zero (from Ly, LŽ) is equal to (A4-30) and is easily 
seen to equal zero. 
A similar proof gives 


[L,, L’]=0 (A4-36) 


It follows at once that, since Ly and L , operate only on 0 and $ and since V? contains 
all 0 and $ terms in the form of L?: 


Hus vo] SEE v. [o6 (A4-37) 
If the potential V for a system is independent of 0 and @, then 
[5 H]=[Ly, H]=0 (A4-38) 


We have shown that H for an atom and L? commute with each other and also with 
Ly, Ly, and L;, but that the latter three operators do not commute with each other. 
Therefore, we can say that there exists a set of simultaneous eigenfunctions for H, L?, 
and one of Ly, Ly, Lz, but not the other two. We choose L+; to be the privileged operator. 
This means that an atom can exist in states having "sharp" values of energy, magnitude 
of angular momentum (hence square of L), and z component of angular momentum, 
but not x or y components. 

H and L? will commute if V in H is independent of 0 and ¢ (central field potential). 
H and L, will commute if V is independent of $, even if it is dependent on 0. This 
is the case for any linear system. Therefore, Mr continues to be a sharp quantity for 
linear molecules like H? or C2H», but total angular momentum value does not, because 
L? does not commute with H, so L is not a good quantum number. This is why the 
main term symbol for a linear molecule is based on Mr, whereas the main term symbol 
for an atom is based on L. 


N A4-5 Determining Eigenvalues for L? and L, 


We will now make use of our operator commutation relations to determine the nature 
of the eigenvalues for L? and L;. We begin by defining two new operators: 


Ly — Ly iL, (A4-39) 
Hh, Sily (A4-40) 


These are called “step-up” and “step-down” operators, respectively, or “raising” and 
“lowering” operators. They correspond to no observable property and are not hermitian. 
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They have been devised solely because they are useful in formal analysis of the sort we 
are doing here. The reason for their names will become apparent soon. 
We now prove the following theorem for the step-up operator L4: 


LzL4 =L4(Lz+1) (A4-41) 
Expanding L+ gives 
EjLy =L, (Ly +ily)=L,L,y+il,Ly (A4-42) 


We now add zero in the form [see Eq. A4-29] 


(L,L, —LyL,—iLy)=0 (A4-43) 
to obtain 
L;Ly-LxLz;ciLyciLzLy (A4-44) 
We again add zero, this time in the form [see Eq. A4-28] 
iLyL; -ibLzLy d Ly —0 (A4-45) 


This results in 


Lobo SE EE eL. (A4-46) 


Rearranging, 


Lz Ly = (Lx +iLy)Lz+Ly tily=(Lytily\(Le+)=Ly(L2+1) (A447) 


This proves Eq. A4-41. An analogous procedure proves the analogous relation for the 
step-down operator L_: 


| it and wp (A4-48) 


Our ultimate goal is to discover the nature of the eigenvalues of L? and L;. Since 
these two operators commute, there must exist a common set of eigenfunctions for them. 
Let us symbolize these simultaneous eigenfunctions of L? and L; with the symbol Y. 
A given eigenfunction Y will be associated with an eigenvalue for L? and a (possibly) 
different eigenvalue for L;. To keep track of these, we use subscript labels / and m, 
defined in the following manner: 


L?^Yi m = kiYi,m (A4-49) 
LzYi,m = km Yim (A4-50) 


Now we make use of our “ladder” operators. Itis not difficult to show (Problem A4-1) 
that 


LzL4Yi,m = (Kin + DL Yn (A4-51) 


This shows that operating on Y; m with the step-up operator L produces a new func- 
tion and that this new function is also an eigenfunction of L;. Furthermore, the new 
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eigenfunction has an eigenvalue that is greater by one than the eigenvalue (km) of 
the original eigenfunction. This is the reason L+ is called a step-up operator. The 
analogous relation for L— is 


LL Yns = s — DL Yo (A4-52) 


The import of Eqs. (A4-51) and (52) is that a set of eigenfunctions for Lz exists with 
eigenvalues separated by unity. 

We now use the ladder operators in another pair of useful operator relations 
(Problem A4-2): 


p uu Yat (A4-53) 
Img hii pd (A4-54) 
It is also possible to show (Problem A4-3) that 
Lt Yim C4 Yid (A4-55) 
ihin 55.3 (A4-56) 


where C and C_ are constants. Equations A4-55 and A4-56 tell us that the ladder 
operators change the functions Y in a manner such that their eigenvalues for L? do not 
change. 

We will now show that 


kk (A4-57) 
We begin with the obvious relation 
(L? — LIY; m = (kı — kZ )Yi,m (A4-58) 
But, from Eq. A4-14, 
LP -L=L+ (A4-59) 
and so 
(Li +15) Yim = (ki — kp) Yom (A4-60) 


Equation A4-60 is remarkable because it indicates that the functions Y; m (which are 
not eigenfunctions of Ly or L,, since these do not commute with Lz) are nevertheless 
eigenfunctions for the combination L + L?. 

We will now show that the eigenvalues of Eq. A4-60 must be positive definite, which 
suffices to prove that k; > k?,. We proceed by recognizing that the operators Ly and L y 
do possess eigenfunctions, though they are different from the functions Y; ,. Let us 
symbolize them f and g, respectively. Then 


Ly fi — Xi fi (A4-61) 
Lygi — yigi (A4-62) 


Because Ly and Ly are hermitian operators, x and y are real numbers and the function 
sets ( f} and {g} are complete and can be assumed to be orthonormal. We can therefore 
expand the functions Y in terms of either set: 


Y. iS dU (A4-63) 
i J 
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Substituting Eqs. A4-63 into A4-60 and operating: 
(fj — km) Vim = (Ly + LS) Yr m = LS Yum + Ly Yim 


CH" f Tip eas gj 


“rd udin (A4-64) 


We can isolate k; — k2 by multiplying from the left by (Y7,»)* and integrating. On the 
left: 


J Yim)" (ki — Yi dv = (ky — k?) J (im)*Vimdv = (92) (A4-65) 


On the right: 


[E Ys Hier | Dede” Yi 
XXe vL 2 foy tes DD qi" 5 [ e» edi 


(A4-66) 
But {f} and {g} are orthonormal sets, and so Eq. (A4-65-66) becomes 


(ei — k= Y e" Px? + y la" Py? (A4-67) 
i J 


Since x and y are real, neither sum has negative terms, so (k; — k2) > 0. (If we resort 
to a physical argument, we see the reasonableness of this since L? 4- Lo is the square 
of the angular momentum projection in the x, y plane.) 

Knowing that k; > k?,, we now consider the implications of applying the step-up 
operator many times to Y; m and then operating with L+: 


L,Ly Ly hy LyYi m= (km +1+1+1---+ DY m4iti+i-+1 
= m! Yim! (A4-68) 


Eventually we will get a kw that is too large to satisfy i < kı. At that point the series 
must terminate, which means that L+Y; m —0. Let us call the maximum-value km 
reached in this way kmax. Use of L— likewise gives us a lowest possible value of km 
which we call kmin. The corresponding eigenfunctions are labeled Y; max and Y; min. 
Since 


L4Yimax — 0 (A4-69) 
it follows that 
L_L4Yi max =0 (A4-70) 
But, substituting for L+ L.. from Eq. A4-54, this can be written 


(L? = LŽ = TheVine =0 (A4-71) 
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so 
k? — kax — kmax =0 (A4-72) 
or 
k? = Kinax + kmax (44-73) 
A similar treatment on L.. L|. Y; min yields 
k? =a — kmin (A4-74) 
From these two expressions for kj, we have 
Kin — Kinax = Kmin + kmax (A4-75) 
This relation can be satisfied only if 
kmax = —kmin (A4-76) 


What has been shown is that, for functions Y; 2, the eigenvalue of L?, kj, determines 
the maximum value of km through kj = ine + kmax, that the minimum value of km is 
—kmax, and that intermediate values of km are given by kmax — 1, kmax — 2, etc. If we 


let the value of kmax be symbolized by /, then we have that k; =/ 24 7=/(1+ 1), so that 
L? Yim —I( -- Yr (A4-77) 
If we let the value of km be symbolized by m, then 
LzY,g =MYi,m (A4-78) 
For a given value of /, m can take on the values 
m=I,l1—1,/—2,...,-I14+1,-l (A4-79) 


There are only two possible scenarios for such a set of numbers m. One set is the 
integers; e.g., if /=3,m —3,2,1,0, —1, —2, —3. The other set is the half-integers; 
e.g., if l —2 3/2, m =3/2, 1/2, —1/2, —3/2. Either way, there are 2/ + 1 allowed values 
of m. 

All of the above relations have been derived from the commutation relations for the 
angular momentum operators. They hold for: 


Electron orbital angular momentum T I 
Electron spin angular momentum S2, Sz 
Resultant of orbital and spin momenta J 2 Jz 
Molecular rotational angular momentum J 2, Jz 
Nuclear spin angular momentum I, L 


If we seek analytical expressions for the functions Y; m, we can start with the assump- 
tion that they are separable into products of two kinds of functions, one type depending 
only on @, the other only on ¢: 


Yim — O(0)9(9) (A4-80) 
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Since L; = —i0/0q, since LzYj 4, =m Yim, and furthermore since Y; m must be single 
valued, it follows that 6 ($) = (1/4/2z) exp(im@), with m —0, +1, +2, .... Note that 
m must be an integer. We have the curious result that, if Y; m is separable into 0 and $ 
parts, the eigenvalues for L, cannot belong to the half-integer set mentioned above. 

The separable, analytic functions Y; ,, with / an integer are the spherical harmonics 
described in Chapter 4 and long known to classical physics. Indeed, the symbol Y; m 
has come to stand for those functions. For systems involving half-integer / and m 
values, no analytical functions of the usual sort can be written. Instead, matrices and 
vectors are used that manifest the correct relationships. Thus, for the spin of a single 
electron, Pauli used the following representation: 


1 0 1 
-() »-() (cla) = (10) ()-: 
en) 2-6). 62 

2 c3 2\10 2\i 0 


A4-5.1 Problems 

A4-1. Prove Equation A4-51. 

A4-2. Prove Equation A4-53. 

A4-3. Prove Equation A4-55. What does this equation imply about L? and L4? 
A4-4. Evaluate (Y; m|Lx|Yı,m} using only operator relations from this appendix. 


A4-5. Demonstrate that the Pauli spin matrices and vectors satisfy the following 
relations: 


a) (a|B) —0 
b) S,8—a«a 
c) Sia=0 
d) [$;, $,] 2 iS; 


A4-6. Use the Pauli spin matrices to evaluate S?o. 


Reference 


[1] H. Eyring, J. Walter, and G. E. Kimball, Quantum Chemistry. Wiley, New York, 
1944. 
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The Pairing Theorem! 


N A5-1 Pairing of Roots and Relation Between Coefficients 
for Alternant Systems 


The HMO assumptions are 


a idfi-j 

Hi; = iB if i+ j are bonded together (A5-1) 
0 if i jare not bonded together 

Sij = àij (A5-2) 

oe = cux (A5-3) 
i 


where x; is a 2p, AO on carbon i. 

An alternant hydrocarbon can be labeled with asterisks to demonstrate the existence 
of two sets of carbon centers in the molecule such that no two atoms in the same set 
are nearest neighbors (see Section 8-9). The following discussion pertains to alternant 
systems. 

The simultaneous equations leading to Hückel energies and coefficients are of the 
form 


cix Tcjctcke ::-—0 (A5-4) 


where x = (œ — E)/B. Atoms j,k,/ must be bonded to atom i if cj, cx, cı are to be 
unequal to zero. Hence, atoms j, k, and / belong to one set of atoms, and atom i belongs 
to the other set. 

If we have already found a value of x and a set of coefficients satisfying the simulta- 
neous equations (A5-4), it is easy to show that these equations will also be satisfied if 
we insert —x and also reverse the signs of the coefficients for one set of centers or the 
other. If we reverse the coefficient signs for the set j, k,l, we obtain, on the left-hand 


side 
GU) 6; — ce Gia (A5-5) 
which is the negative of Eq. A5-4 and hence still equals zero. If we reverse the sign of 
cj, we have 
CASK omic pre OE Or en (A5-6) 


which is identical to Eq. A5-4. 
l1 See Coulson and Rushbrooke [1]. 
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This proves that each root of an alternant hydrocarbon at x(4 0) has a mate at —x 
and that their associated coefficients differ only in sign between one or the other sets of 
atoms. Note that, if x = 0, the c; term vanishes, leaving coefficients for only one set of 
centers. Reversing all signs in this case corresponds to multiplying the entire MO by 
—1, which does not generate a new (linearly independent) function. Thus, it is possible 
for an alternant system to have a single, unpaired root at x — 0. It is necessary for odd 
alternants to have such a root. An even alternant may have a root at x = 0, but, if it has 
one such root, it must have another since, in the end, there must be an even number of 
roots. 


E A5-2 Demonstration That Electron Densities Are Unity in 
Ground States of Neutral Alternant Hydrocarbons 


From n AOs result n MOs. The AOs as well as the MOs can be assumed normalized 
and (in the HMO method) orthogonal. If each AO contains one electron, the electron 
density at each AO is unity. If these AOs are combined to form MOs, and each MO has 
one electron, each AO electron density would still be unity, since the set of all singly 
occupied MOs is just a unitary transformation of the set of all singly occupied AOs. 
(The matrix equivalent of these statements is 


Ccic-1-2cc! (A5-7) 


The left equality means that the sum of squares (absolute) of coefficients over all atoms 
in one MO is unity, so the MO is normalized. The right equality means that the sum of 
squares of coefficients over one atom in all MOs is unity.) 

For an alternant hydrocarbon, however, the squares of coefficients in an MO at 
E=a-+kf are identical to those in the MO at E =a — kf. Therefore, no change in 
electron density will result if each electron in the upper half of our MO energy spectrum 
is shifted to its lower-energy mate. Thus, the resulting state, which is the neutral ground 
state, still has unit electron density at each AO. 


E A5-3 A Simple Method for Generating Nonbonding MOs 


An immediate consequence of Eq. A5-4 is that the coefficients for any nonbonding MO, 
for which x = 0 by definition, satisfy the following simple rule: The coefficients on all 
the atoms attached to any common atom sum to zero. Consider the nonbonding MOs 


below: 
0 
0 MZ —Ve Ve V25 
-v;, 0 —Ve Ve 0 -vz, 


It is clear that, no matter which atom one chooses as reference, the sum of coefficients 
of attached atoms is zero. 
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It is possible to use this observation to generate nonbonding MOs without the aid 
of a computer or tabulation. For odd-alternant systems such an MO is guaranteed to 
exist, and the recipe is especially easy, so we start with these. The procedure is: 


1. Divide the centers into asterisked and unasterisked sets, as described in Chapter 8. 
One set will have fewer centers. Set all of the coefficients in this set equal to zero. 


2. Choose one of the nonzero sites and set its coefficient to be x. Then work around 
the molecule, setting other coefficients to values needed to satisfy the “sum to zero" 
rule. 


3. When finished, evaluate x by requiring that the sum of squares of coefficients equal 
unity. 


For example, consider the naphthyl system: 
TE 
CI 


The asterisked centers are more numerous, so we set the others to zero. 


ž11 


Now we set one of the nonzero sites equal to x. Let us use the one marked with an 
arrow. Then we move around the molecule, setting the other values. 


-x -x 2x -x 2x -3x 
. M. INO g oN " aw 
x x x x x x 
—X =X —x 
0 0 0 0 0 0 0 0 
Now we set the sum of squares to unity: 17x? = 1; x = 0.242. The final nonbonding 


MO is 


oe 0.484 —0.727 


[ 
0.242... 0 0.242 
—0.242 


If an even alternant has a nonbonding MO, then it must have a pair of them. Each 
one corresponds to setting a different subset of coefficients equal to zero. 

This simple recipe would be little more than a parlor trick were it not for the fact 
that the nonbonding MO is often very important in determining a molecule's chemical 
or physical properties. For example, the neutral molecule used above as an example 
is alternant, hence has zr-electron densities of one at every carbon. However the spin 
density is controlled by the nonbonding MO, since that is where the unpaired electron 
is, so the ESR splitting pattern should correspond to a series of coupling constants 
proportional to the squares of these nonbonding MO coefficients (if we ignore negative 
spin density). 

If the neutral radical is ionized to the cation, the electron is lost from the nonbonding 
MO, and so the deficiency of electronic density (i.e., the positive charge) appears on 
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carbons with nonzero coefficients, in proportion to the squares of the coefficients. In 
the case of our previous example, the cation charge distribution is predicted to be 


40.06 +0.23 +0.53 


+0.06 


If we were to modify the molecule chemically by attaching a methyl group (which 
donates z electron charge via hyperconjugation) or by substituting for carbon a more 
electronegative nitrogen atom, we could hope to influence the ease of ionization. But 
it is apparent that such modifications will have greatest effect if they occur at sites 
where the largest changes in electron density occur upon ionization, i.e., at positions 
11 and 1. Modifying the molecule at a position where a zero coefficient exists in 
the nonbonding MO (i.e., positions 4, 5, or 7) should have little effect on the ease of 
carbocation formation. 


Reference 


[1] C. A. Coulson and G. S. Rushbrooke, Proc. Cambridge Phil. Soc. 36, 193 (1940). 
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Hückel Molecular Orbital Energies, 
Coefficients, Electron Densities, 
and Bond Orders for Some Simple 
Molecules 


Each molecule is labeled as alternant or nonalternant. For alternants, only the occupied 
MO data are tabulated since the remainder may be generated by use of the pairing 
theorem (see Appendix 5). Bond orders are tabulated for only one bond from each 
symmetry-equivalent set in a molecule. 


HMO root = (a — E)/B 
number of electrons in MO when molecule is in neutral ground state 
Cj LCAO-MO coefficient of AO at atom i 
qi -electron density on atom i 
pij *-bond order between atoms i and j 
Er total x energy of the molecule = sum of zr-electron energies 
Molecules in this tabulation are grouped according to the number of centers in the 


conjugated system. (In all cases, it is assumed that the system is planar and undistorted 
which, in some cases, is not correct.) 


605 


606 


Index of System Tabulated 


Hückel Molecular Orbitals 


oe 


Ethylene 


——— —» 
Allyl 


Cyclopropenyl 


eee 
Butadiene 


Cyclobutadiene 


2-Allylmethyl 


[5— 


Methylene cyclopropene 


Pentadienyl 


Cyclopentadienyl 


Cyclobutadienyl methyl 


*— —e— —. 


e———e —e- 


Hexatriene 


X 


Butadiene-2,3-bimethyl 


e. 


Benzene 


gs 
E. 

z 

G 

5 

© 


Heptatrienyl 


25 


Benzyl 


9, 


Cycloheptatrienyl 


Octatetraene 


s 


Cyclooctatetraene 


S 


Benz-cyclopentadienyl 


8 


Azulene 


s 


Naphthalene 
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Two Centers 


Ethylene (alternant) C2H2 


oe 


1 2 all q = 1.0, pj2 = 1.0, 
xz —2a +26 


n x c1 C2 


2 —1.000 0.7071 0.7071 


Three Centers 


Allyl radical (alternant) C3H5 


e- 


1 2 3 all q = 1.0, p12 — 0.707, 
Er — 3a + 2.82848 


n x Cl C2 C3 


2 —1.4142 0.5000 0.7071 0.5000 
1 0.0000 0.7071 0.0000 —0.7071 


Cyclopropenyl radical (nonalternant) C3H3 
2 3 


V 


1 all q = 1.0, pi? — 0.5, 
Ez — 3a + 3.00008 


n x Cl €2 C3 


—2.0000 0.5774 0.5774 0.5774 
1.0000 —0.8165 0.4082 0.4082 
1.0000 0.0000 0.7071  —0.7071 


NIENIF N 
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Four Centers 


Hückel Molecular Orbitals 


Butadiene (alternant) C4H6 


all q= 1.0, p12 = 0.8944, P2 = 0.4472, 
Er —4a -- 4.47218 


x C1 C2 C3 C4 


—1.6180 0.3718 0.6015 
—0.6180 0.6015 0.3718 


0.6015 
—0.3718 


0.3718 
—0.6015 


Cyclobutadiene (alternant) C4H4 


all q — 1.0, pi; =0.5, Ex —4a + 4.0008 


[] 


1 2 


n x ĉi C2 C3 C4 

2 —2.0000 0.5000 0.5000 0.5000 0.5000 
1 0.0000 0.5000 0.5000  —0.5000 —0.5000 
1 0.0000 0.5000 —0.5000 —0.5000 0.5000 


2-AllyImethyl (alternant) C4Hg 


2 


all q — 1.0, pi2 = 0.5774, Ex —4a + 3.46410 


n x c1 C2 C3 C4 

2 —1.7320 0.7071 0.4082 0.4082 0.4082 
1 0.0 0.0000 0.7071  —0.7071 0.0000 
1 0.0 0.0000 0.4082 0.4082 —0.8165 


Methylene cyclopropene C4H4 (nonalternant) 


4 


Eq =4a +4.9624B, pi» = 0.4527, 
p23 = 0.8176, pi4 =0.7583 


C] C2 C3 C4 


—2.1701 0.6116 0.5227 0.5227 0.2818 
—0.3111 0.2536 —0.3682  —0.3682 0.8152 
1.0000 0.0000 0.7071 | —0.7071 0.0000 
1.4812 0.7494  —0.3020 4 —0.3020  —0.5059 
qi = 0.8768 0.8176 0.8176 1.4881 
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Five Centers 


Pentadienyl radical (alternant) C5H7 


JU NO CE Ue all q = 1.0, pi? = 0.7887, p23 — 0.57714, 
Er —5a + 5.46418 


n x CI C2 C3 C4 C5 


2 —1.7320 0.2887 0.5000 0.5774 0.5000 0.2887 
2 —1.0000 0.5000 0.5000 0.0000 —0.5000 —0.5000 
1 0.0000 0.5774 0.0000 —0.5774 0.0000 0.5774 


Cyclopentadienyl radical (nonalternant) C5H5 
l all q = 1.0, pi2 = 0.5854, E4 = 5a + 5.85418 


5 2 
CI n x C] C2 C3 C4 C5 


2 —2.0000 0.4472 0.4472 0.4472 0.4472 0.4472 
3/2  —0.6180 0.6325 0.1954  —0.5117 | —0.5117 0.1954 
3/2 —0.6180 0.0000 —0.6015  —0.3718 0.3718 0.6015 
0 1.6180 0.6325  —0.5117 0.1954 0.1954 —0.5117 
0 1.6180 0.0000 0.3718 | —0.6015 0.6015  —0.3718 


Cyclobutadienylmethyl radical (alternant) C5H5 


5 all q = 1.0, pi; — 0.3574, po3 —0.6101 
Dis — 0.8628, Ex = 5o + 5.59598 


n x €1 C2 C3 CA C5 


2. —2.1358 0.5573 0.4647 0.4351 0.4647 0.2610 
—0.6622 —0.4351 0.1845 0.5573 0.1845  —0.6572 
1 0.0000 0.0000  —0.7071 0.0000 0.7071 0.0000 


N 
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Six Centers 


Hückel Molecular Orbitals 


Hexatriene (alternant) CgHg 


1 2 3 4 5 6 


all q = 1.0, pi2 = 0.8711, p23 = 0.4834, 
p34 =0.7848, Ex = 6a + 6.98796 


n x Cl C2 £3 C4 C5 C6 

2 —1.8019 0.2319 0.4179 0.5211 0.5211 0.4179 0.2319 
2 —1.2270 0.44179 0.5211 0.2319 —0.2319 —0.5211  —0.4179 
2 —0.4450 0.5211 0.2319 4.0.4179  —0.4179 0.2319 0.5211 


Butadiene-2,3-bimethyl (alternant) C6Hg 
5 4 


all q = 1.0, pi; = 0.6667, p23 = 0.3333, Ex = 6a + 6ß 


3 


2 n x Cl C2 C3 C4 C5 C6 

l 6 
2. —2.0000 0.2887 0.5774 0.5774 0.2887 0.2887 0.2887 
2 —1.0000 0.4082 0.4082  —0.4082  —0.4082 0.4082  —0.4082 
2 0.0000  —0.5000 0.0000 0.0000  —0.5000 0.5000 0.5000 


Benzene (alternant) C6H6 


all q = 1.0, pi = 0.6667, Ex = 6a +88 


6 2 
n x c1 c2 C3 C4 C5 €6 
5 3 
4 2 —2.0000 0.4082 0.4082 0.4082 0.4082 0.4082 0.4082 
2  —1.0000 0.0000 0.5000 0.5000 0.0000 —0.5000  —0.5000 
2 —1.0000 0.5774 0.2887  —0.2887  —0.5774 | —0.2887 0.2887 


Fulvene (nonalternant) C6H6 


Eq — 6a 4- 7.46598, pi; — 0.7719, p3 = 0.5202, pas — 0.4491, 


3 2 ps6 = 0.7586 
n x Cl C2 C3 C4 C5 C6 

2 —2.1149 0.4294 0.3851 0.3851 0.4294 0.5230 0.2473 

2. —1.0000 0.0000 0.5000 0.5000 0.0000  —0.5000 — —0.5000 

2 —0.6180 0.6015 0.3718 —0.3718 | —0.6015 0.0000 0.0000 

0 0.2541  —0.3505 0.2795 0.2795 | —0.3505  —0.1904 0.7495 

0 1.6180  —0.3718 0.6015  —0.6015 0.3718 0.0000 0.0000 

0 1.8608 | —0.4390 0.1535 0.1535  —0.4390 0.6635 | —0.3566 

qi= 1.0923 1.0730 1.0730 1.0923 1.0470 0.6223 
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Seven Centers 


Heptatrienyl radical (alternant) C7Ho 


all q — 1.0, E, — 7o + 8.05478 
Dio = 0.8155, p;3 — 0.5449, p34 = 0.6533 


Cl C2 C3 C4 C5 C6 c7 


2 —1.8478 0.1913 0.3536 0.4619 0.5000 0.4619 0.3536 0.1913 
2 —1.4142 0.3536 0.5000 0.3536 0.0000 —0.3536  —0.5000 —0.3536 
2 —0.7654 0.4619 0.3536 —0.1913 —0.5000  —0.1913 0.3536 0.4619 
1 0.0000 —0.5000 0.0000 0.5000 0.0000 —0.5000 0.0000 0.5000 
Benzyl radical (alternant) C7H7 

7 

1 

6 2 
5 3 

4 all q= 1.0, pi? = 0.5226, p2 = 0.7050, 

p34 = 0.6350, pj; = 0.6350, Ex = 7a + 8.72068 

n x c1 C2 C3 C4 C5 C6 C7 
2 —2.1010 0.5000 0.4063 0.3536 0.3366 0.3536 0.4063 0.2380 
2 —1.2593 4.0.5000 —0.1163 0.3536 0.5615 0.3536 —0.1163  —0.3970 
2 —1.0000 0.0000 0.5000 0.5000 0.0000  —0.5000 — —0.5000 0.0000 
1 0.0000 0.0000 —0.3780 0.0000 0.3780 0.0000  —0.3780 0.7560 


Cycloheptatrienyl radical (nonalternant) C7H7 


1 
7 


N 


6 3 

5 4 all q = 1.0, pi2 = 0.6102, Ex =7 + 8.54298 
n x ci c2 c3 c4 c5 c6 c7 
2 —2.0000 0.3780 0.3780 0.3780 | 0.3780 0.3780 0.3780 | 0.3780 
2 —1.2470 .—0.5345 —0.3333 0.1189 0.4816 0.4816 0.1189  —0.3333 
2. —1.2470 0.0000 —0.4179  —0.5211  —0.2319 0.2319 0.5211 0.4179 
1 0.4450 0.5345 —0.1189  —0.4816 0.3333 0.3333 —0.4816 —0.1189 
5 0.4450 0.0000 0.5211  —0.2319  —0.4180 0.4180 0.2319  —0.5211 
0 1.8019 —0.5345 0.4816 | —0.3333 0.1189 0.1189  —0.3333 0.4816 
0 1.8019 0.0000 0.2319 | —0.4179 0.5211  —0.5211 0.4179  —0.2319 
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Eight Centers 


Hückel Molecular Orbitals 


Octatetraene (alternant) CgHj9 


oe o—_s 9— — e— — e—— — 


1 2 3 4 5 6 7 8 


all q = 1.0, pi? — 0.8621, p23 = 0.4948, 
p34 = 0.7581, pas = 0.5288, Er = 8o + 9.51758 


n x Cl C2 C3 C4 C5 C6 c7 Cg 

2 —1.8794 0.1612 0.3030 0.4082 0.4642 0.4642 0.4082 0.3030 0.1612 
2 —1.5321 0.3030 0.4642 0.4082 0.1612 —0.1612 —0.4082  —0.4642  —0.3030 
2 —1.0000 —0.4082  —0.4082 0.0000 0.4082 0.4082 0.0000 —0.4082  —0.4082 
2 —0.3473 0.4642 0.1612 —0.4082  —0.3030 0.3030 0.4082 —0.1612  —0.4642 


Cyclooctatetraene (alternant) CgHg 


7 2 
6 3 

5 4 all q = 1.0, pi? = 0.6035, Ex = 8a + 9.6568 
n X. Cl C2 C3 C4 C5 C6 c7 Cg 
2 —2.0000 0.3536 0.3536 0.3536 0.3536 0.3536 0.3536 0.3536 0.3536 
2 —1.4142 0.3536 0.0000 —0.3536 | —0.5000 —0.3536 0.0000 0.3536 0.5000 
2 —1.4142 0.3536 0.5000 0.3536 0.0000 —0.3536 —0.5000 —0.3536 0.0000 
1 0.0000 0.3536 0.3536 | —0.3536 —0.3536 0.3536 0.3536 | —0.3536  —0.3536 
1 0.0000 0.3536 —0.3536  —0.3536 0.3536 0.3536 | —0.3536 —0.3536 0.3536 


Nine Centers 


Benzcyclopentadienyl radical (nonalternant) CgH7 


7 1 


6 8 
2 P12 = 0.6592, p49 = 0.6071, psg = 0.6630, 
5 96 pig — 0.4790, pas = 0.6363, pgo = 0.5118, 
4 Eq — 9o + 11.87578 
n x c1 c2 C3 [v C5 C6 c1 cg cg 
2 —2.3226 0.3203 0.2758 0.3203 0.2988 0.2259 0.2259 0.2988 0.4681 0.4681 
2 —1.5450 —0.3114 —0.4031 —0.3114 0.2689 0.4934 0.4934 0.2689 —0.0780 —0.0780 
2 —1.1935 0.2992 0.0000 —0.2992 —0.4841 —0.2207 0.2207 0.4841 0.3571 —0.3571 
2 —0.7293 —0.2054 —0.5634 —0.2054 0.09035 —0.3454 —0.3454 0.0035 0.4136 0.4136 
1 —0.2950 0.5428 0.0000 —0.5428 0.3355 0.2591 —0.2591 —0.3355 0.1601 —0.1601 
0 0.9016 0.1548 —0.3434 0.1548 —0.5424 0.2852 0.2852 —0.5424 0.2038 0.2038 
0 1.2950 —0.2591 0.0000 0.2591 —0.1601 0.5428 —0.5428 0.1601 0.3355 —0.3355 
0 1.6052 0.4840 —0.5711 0.4840 0.1884 _—0.0699 —0.0699 0.1884 —0.2495 —0.2495 
0 2.1935 —0.2207 0.0000 0.2207 0.3571 —0.2992 0.2992 —0.3571 0.4841 —0.4841 
qi= 0.9571 1.1119 0.9571 0.9218 0.9920 0.9920 0.9218 1.0730 1.0730 
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Ten Centers 


Azulene (nonalternant) CjgHg 


P12 = 0.6560, p4,10 = 0.5858, psg = 0.6389, 
p19 — 0.5956, p45 = 0.6640, po 10 = 0.4009, 
Eg = 10a + 13.36358 


n x c1 c2 C3 c4 c5 c6 c7 cg c9 c10 


2 —2.3103 0.3233 0.2799 0.3233 0.2886 0.1998 0.1730 0.1998 0.2886 0.4670 0.4670 
2 —1.6516 —0.2678 —0.3243 —0.2678 0.1909 0.4333 0.5247 0.4333 0.1909 —0.1180 —0.1180 
2 —1.3557 0.2207 0.0000 —0.2207 —0.4841 —0.3571 0.0000 0.3571 0.4841 0.2992 —0.2992 
2 —0.8870 —0.2585 —0.5829 —0.2585 0.2186 —0.1598 —0.3603 —0.1598 0.2186 0.3536 0.3536 
2 —0.4773 0.5428 0.0000 —0.5428 0.1601 0.3355 0.0000 —0.3355 —0.1601 0.2591 —0.2591 
0 0.4004 —0.0632 0.3158 —0.0632 0.4699 0.1023 —0.5109 0.1023 0.4699 —0.2904 —0.2904 
0 
0 
0 
0 


0.7376 —0.2992 0.0000 0.2992 —0.3571 0.4841 0.0000 —0.4841 0.3571 0.2207 —0.2207 
1.5792 0.4364 —0.5527 0.4364 —0.0844 0.2697 —0.3416 0.2697 —0.0844 —0.1365 —0.1365 
1.8692 —0.2500 0.2675 —0.2500 —0.3233 0.4045 —0.4328 0.4045 —0.3233 0.1998 0.1998 
2.09053 —0.2591 0.0000 0.2591 0.3355 —0.1601 0.0000 0.1601 —0.3355 0.5428 —0.5428 
qi= 1.1729 1.0466 1.1729 0.8550 0.9864 0.8700 0.9864 0.8550 1.0274 1.0274 


Naphthalene (alternant) C15Hg 


8 1 
9 


7 2 
3 
$ 10 
5 4 all q = 1.0, Ex = 10a + 13.683258 
D12 — 0.7246, p23 = 0.6032, p19 — 0.5547, p9 10 = 0.5182 
n x c] c2 c3 c4 c5 c6 c7 cg c9 c10 
2 —2.3028 0.3006 0.2307 0.2307 0.3006 0.3006 0.2307 0.2307 0.3006 0.4614 0.4614 
2 —1.6180 0.2629 0.4253 0.4253 0.2629 —0.2629 —0.4253 —0.4253 —0.2629 0.0000 0.0000 
2 —1.3028 0.3996 0.1735 —0.1735 —0.3996 —0.3996 —0.1735 0.1735 0.3996 0.3470 —0.3470 
2 —1.0000 0.0000 —0.4082 —0.4082 0.0000 0.0000 —0.4082 —0.4082 0.0000 0.4082 0.4082 
2 —0.6180 0.4253 0.2629 —0.2629 —0.4253 0.4253 0.2629 0.2629 —0.4253 0.0000 0.0000 
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Derivation of the Hartree-Fock 
Equation 


This appendix is divided into two parts. In the first section we develop the formula for 
the expectation value E = (y| H |y) for the case in which y is a single determinantal 
wavefunction over MOs. In the second section we derive the Hartree-Fock equation 
by requiring E to be stationary with respect to variations in y. 


E A7-1 The Expansion of E in Terms of Integrals over MOs 


We limit discussion to the case in which y is a single, closed-shell determinant. We 
will develop our arguments by referring to a four-electron example: 


wa = (4 1? |1(010)02 (002 (4)| (A7-1) 


Recall that this is the shorthand formula for a Slater determinant. Each $ is a normalized 
MO, the MOs are assumed to be orthogonal, and a bar signifies that an electron possesses 
P spin. As we develop our arguments within the context of y, we will generalize them 
to apply to the general 2n electron closed-shell wavefunction 


Won = [Qn]? 161(1)010)62 3262 (4) - -- bn (20 — Lon (2n)| (A72) 


When y4 is expanded according to the rule for determinants (Appendix 2), we obtain 
4! products. We note the following features of the expanded form. 


1. There is one product, occurring with coefficient +1, which is identical to the product 
appearing in the shorthand form of Eq. (A7-1). We refer to this as the "leading 
term." 


2. An equivalent way of expressing a Slater determinant is via the expression (for y4) 


Va = (4) 1? 3 CD? P ($11) 61 2)623) 624) (A7-3) 
P 


Here P stands for all the sequences of permutations of electron labels that lead 
to different products (1.e., P is a permutation operator), and p is the number of 
pairwise permutations in a given sequence. For y4 there are 4! sequences P, the 
simplest being “no permutations” (hence, p — 0) which produces the leading term. 
Then there are single permutations, such as Pi.» (with p = 1), which produces the 
term —d (2) (1)¢2(3)¢2(4). There are also double permutations, etc. Accord- 
ing to Eq. (A7-3), any term differing from the leading term by an odd number of 


614 


Appendix 7 615 


permutations will appear with coefficient —1. We will be particularly concerned 
with products that differ from the leading term by a single permutation. 


3. A single permutation may be made to occur between electrons in MO's with the 
same spins or different spins. In the latter case, two electrons in the singly permuted 
product will disagree in spin with their counterparts in the leading term. In the 
former case, no such spin disagreement will exist. 


4. Terms also appear in y4 corresponding to more than a single permutation of electron 
indices. 


Itis useful to pick a representative example of each type of product mentioned above. 
For y4, we have the following: 


Leading term: $1 (11 (2)62(3)$2(4) 

Singly permuted term: — $4 (1)61 (4)¢2(3)o2(2); Spin agreement: (P5 4) 
Singly permuted term: — $4 (2)01(0 02 (3) 95 (4); Spin disagreement: ( Pj ?) 
Doubly permuted term: $1 (2)1(4)¢2(3)¢2(1); (Pi 2 Pia) 


Upon expanding UMP | V4), we obtain a set of 4! products on both the left and right- 
hand sides of H: 


E-(4! J {PF C06: 2095 (3095 (4) — 6100010095095 (2) — +--+} 
x £(1,2,3, 4)($1(1)010)02 3)02 (4) — $1(001(0423)92 Q) — --- Jt (A7-4) 


This can be expanded into a set of integrals, one for each term on the left: 


Ē = (4)! | J Pi DEDO (4) T, 2, 3, 4) [1 D (2042 202 (4) 
— $1(0041(042(3042Q) — $1061(002)02(4) — -dr 
- J $1 (00910095 005 (2) HC, 2, 3, 4) [10091 (2002 02 (4) 


— p (DA (HABA — $10)6:0062 )6»(4) — ---] «| (A7-5) 


etc. In Eq. (A7-5), E is a sum of 4! integrals, each containing one term from the set on 
the left of and all 4! from the set on the right. 

At first, it might seem that we must evaluate all of the 4! integrals in Eq. (A7-5). But 
it can be shown that these integrals, times their +1 or —1 coefficients, are all equal to 
each other, enabling us to write E as 4! times the first integral: 


E- J 91 (DOF O93 G)91 (4) A, 2,3, 4) 
Y CD? Pi (141 DHG) d (A7-6) 
P 
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The demonstration that the various integrals in Eq. (A7-5), times their coefficients, 
are equal to each other is as follows. Consider the second integral in Eq. (A7-5). 
Note that, if we permute electrons 2 and 4 in that integral, we restore the term on the 
left of É to its original “leading term" order, thereby making that product identical 
to its counterpart in the first integral. Furthermore, if we carry out this permutation 
throughout the whole of the integrand of integral number 2 (i.e., in A, in all 4! products 
to the right of H , and in dt), we will not affect the value of the integral. (Recall that, 


for example, 
1 2 1 2 
[ 9] yays f Pay | xdx 
0 1 0 1 


The result of this permutation P» 4 on the second integral in Eq. (A7-5) is 


J pi DEDO (4) ACL, 4, 3, 2) [1 A82) 
—$1 (1) G1 (2)$2(3)62(4) — +++ ]dc (A7-7) 


Now A is invariant under exchange of electron indices, and the set of products to the 
right of Hin Eq. (A7-7) is the same set we had before, but their order is changed, and 
the whole product evidently differs by a factor of —1 from the set in the first integral. 
Therefore, we can say that the first and second integrals of Eq. (A7-5) have the same 
absolute value but different signs. However, the fact that these integrals contribute to 
E with opposite signs cancels the sign disagreement. In this way, every integral in 
Eq. (A7-5) can be compared to the leading integral and Eq. (A7-6) verified. This much 
simplified expression for E is, for the 2n-electron case 


= J 6: (091 Q) -pn — 1)6* (2n) AC, 2,... ,2n) 


[Eerens (2): -Pnn — on em) dr (A7-8) 


P 


At this point we write out H more explicitly. It is, in atomic units, 


2n nuclei 2n—1 2n 
a.z... Y (-jst- 3 Zaru) +È nj (T9 


i=l i=l j=i+1 
2n ! 

=) BHO" M ang (A7-10) 
i=l 


Here, Y ' is a shorthand symbol for the double sum in Eq. (A7-9). We see that Fis 
composed of one-electron operators, Hy which deal with the kinetic and nuclear- 
electron attraction energies for electron i, and two-electron operators for interelectronic 
repulsion. The internuclear repulsion is omitted since, for a given nuclear configuration, 
it is simply a constant that can be added to the electronic energy. Note for future 


reference that H has no dependence on electron spin coordinates. 
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If we insert the expression (A7-10) for Å into Eq. (A7-6) for E, it is easy to to see 
that we can expand the result into separate sets of integrals over one-and two-electron 
operators. We consider first the one-electron integrals. For y4, these are 


J (4 (1) (2)63 B)G3 (4)) 
4 

x Y AGS” [6100610002062 (00 — 1000160620020) —--- Jd (AT-11) 
i-l 


Upon expanding this, the first set of integrals we obtain is 


J etcéroeroese) [HR + AY" + HE na | eco oe» odo coa 
(A7-12) 


This can be expanded again. The first integral contains the operator H5, which does 
not act on electrons 2-4. This allows a separation into a product of integrals as follows: 


[OED osi ome co nes dao dt (A7-13) 


= | eccomipeaoan | Odd | HONO dn | HOA du 
(A7-14) 


The last three integrals are overlap integrals and are all unity by virtue of normality of 
the MOs. The first integral, a “core integral,’ is normally symbolized H;1. Here the 
subscripts refer to the MO index, not the electron index: 


Hii = J 96; AAG) gi) dt (AT-15) 


Therefore, Eq. (A7-13) equals H1;. By continued expansion, Eq. (A7-12) can be shown 
to be equal to Ay, + Hii + Ho» + Ho» — 2(Hi1 + M22). In this case, we have been 
dealing with identical MO products on the two sides of the operator. As we continue 
evaluating the expansion of Eq. (A7-12), we next encounter an integral in which the 
products differ by a permutation, namely, 


- J ei 61 9$ 695 6| () Hoy + AG d Has 61 (1)41(4)628)b22) de 
(A7-16) 


Again, for H*?'*(1), this may be written 


- [ Magadan [ KOLO [ 0:4 [ és oda 
(A7-17) 
Orbital orthogonality will cause the second and fourth integrals to vanish. In general, 


if the two products differ by one or more permutations, they will have two or more sites 
of disagreement. Upon expansion, at least one disagreement will occur in an overlap 
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integral, causing the integral to vanish. Thus, except for the integral involving identical 
products [Eq. (A7-12)], all the integrals obtained by expansion of Eq. (A7-11) vanish. 
Our result, generalized to the 2n-electron case, is (see Appendix 11 for bra-ket notation) 


Won 


We now turn to integrals containing two-electron operators. Consider, for example, 
an integral that has products differing in two places, 


— ($11) d1 (2)62(3)G2(4) I /risl $ 9012202 (1) 62(4)) (A7-19) 


This can be partly separated into a product of integrals over different electron 
coordinates. 


(A7-19) = —(1(1)$2(3)|1/r1311 3042 (0)) (1 (2) 11 (2) ($2(4)162(4)) (A7-20) 


Observe that the two disagreements are inside the two-electron integral, and that the 
overlap terms both show complete internal agreement and are therefore equal to unity. 
It is clear that, if our two products differed in more than two places, at least one such 
disagreement would appear in an overlap integral, causing the whole integral to vanish. 
Therefore, two-electron integrals need be considered only if they involve products 
differing by zero or one permutations. Let us consider these two possibilities separately. 
If there are no disagreements, we have for y4, 


2n 


Yam 


i=1 


2 -) 2H (A7-18) 
i=l 


(1 (1)41 2)2(3)424) | 7 1/5] (00 9200620) 

= (611) 61 2)I1/ri2l6i D) + (1 C02) /risldi (0020) 

HA 0)02 011 /rialói (1) b2(4)) + (G1 2)$2(3)11/723161 0)62 (8) 
HADRAN ralh DRA) + ORAA) — (AT-21) 


These integrals give the coulombic repulsion between electrons in MOs. They are 
symbolized Jj;, where 


Aj = (DHD rial 6 D) = Gli) (A7-22) 


Here ¢; and $; may be associated with either spin. Because the operator and MOs 
commute, the integrand can be rearranged to give 


Jij— J $; (096;000/712)650)6;Q) dr dv? = (ii| jj) (A7-23) 


Some people prefer this form because it places the two mutually repelling charge 
clouds on the two sides of the operator. It is important to realize that the parenthetical 
expression (ii| 77) and the bra-ket shorthand (i7|i7) correspond to different conventions 
for electron index order, and are really the same integral. Returning to Eq. (A7-21), we 
see that it is equal to 


2 
Just dis SE dob A24 I= y. EE 24; (A7-24) 
i=l j#i 
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We must now consider the case where the products differ by a single permutation, 
henceintwo places. An example has been provided in Eq. (A7-19). We noted that, when 
the operator 1/r;; corresponds to electrons i and j in the positions of disagreement, 
the overlap integrals are all unity. Otherwise, at least one overlap integral vanishes. An 
integral like that in Eq. (A7-20) is called an exchange integral. 

Exchange integrals can occur only when the product on the right of the operator 
differs from the leading term by a single permutation. Hence, exchange integrals always 
enter with a coefficient of —1. 

We noted earlier that two classes of singly permuted products exist. One class 
involves permutations between electrons of like spin. In such a case, $; and $; appear 
throughout the integral K;; with spin agreement. For cases where electrons of different 
spin have been permuted, spin disagreement forces the exchange integral to vanish. 
(Since 1/r;; is not a spin operator, the integration over spin coordinates factors out and 
produces a vanishing integral if spins disagree.) 

The result of all this is that each singly permuted product can give — K;; if the 
permutation is between electrons of like spin in MOs $; and $;, and zero otherwise. 
For w4, the acceptable permutations can be seen to be electron 1 with 3 and electron 2 
with 4, both of these occurring between $1 and $» space MOs. Hence, the contribution 
to E is —2Kj2. Combining this with J terms gives 


(va y» l/rij 


From the definitions of J and K, it is apparent that 


Va) = Jii +42- 2 Ki + J» (A7-25) 


JjJj, Kij-kKj, Ki = Ji 


This allows us to rewrite Eq. (A7-25) as 


2411 — Ki 27i — Ki2 2451 — K91 2452 — K2 
2. 15 

=) 5 04: (A7-26) 
i=l j=1 


Generalizing to the 2n-electron, closed-shell case and adding in our one-electron 
contribution, 


E= (WonlHlWon) 229 5 Hi 9 9 Cy- Kip (AT-27) 
i=1 i=l j=1 


This is the desired expression for E in terms of integrals over MOs d; for a single- 
determinantal, closed-shell wavefunction. 


NH A7-2 Derivation of the Hartree-Fock Equations 


To find the “best” MOs, we seek those that minimize E, that is, those MOs $ for which 
E is stationary to small variations 6$. But there is a restriction in the variations ó$. 
The MOs can only be varied in ways that do not destroy their orthonormality since this 
property was assumed in deriving Eq. (A7-27). This means that, for proper variations 
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8$ at the minimum E, both E and all the MO overlap integrals S;; = ($j|ó;) must 
remain constant. (5;; must equal unity when i = j, zero otherwise.) If E and S; j are 
constant, any linear combination of them is constant too. Thus, we may write that, at 
the minimum Æ, 


cok + z > cj, Sij = constant (A7-28) 
i j 

for our restricted type of ó$. This equation will hold for any set of coefficients c as 
long as 5¢ is of the proper restricted nature. However, it is possible to show that, for 
a particular set of coefficients, Eq. (A7-28) is satisfied at minimum E for any small 
variations ó$. The particular coefficients are called Lagrangian multipliers. They are 
of undetermined value thus far, but their values will become known in the course of 
solving the problem. The technique, known as “Lagrange’s method of undetermined 
multipliers" is from the calculus of variations.! The Lagrangian multipliers will ulti- 


mately turn out to be essentially the MO energies. For future convenience we write 
Eq. (A7-28) in the form 


E-2 > y» €jj Sij = constant for 0o (A7-29) 
i j 


where we now understand óó to be unrestricted and €;; to be some unknown special 
set of constants. The stability of the quantity on the left-hand side of Eq. (A7-29) may 
be expressed as follows: 


8E— 282 dues ;j—0 (A7-30) 


or, expanding E, 


20 Mi + De- 8Kij)— 23^ Y 6/88 =0 (A7-31) 


i=l j=1 i=l j=1 


The variations occur in the MOs 4, and so 
ôSij = [EDD +S erooseiman (A7-32) 
Hj; = IE NE ód; (1) dti (A7-33) 
àJij = [ sioe; ma mace; dtdt2 
+ [ erae; ma mace; dt,dtz + complex conjugates — (A7-34) 
It is convenient to define a coulomb operator d (1) as 


jo | EOU dt? (A7-35) 


lFor an introduction to this topic, see Margenau and Murphy [1]. 
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Using this definition we can rewrite Eq. (A7-34) as 


j= f sof) ei aan + [ 6700600 + complex conjugates 
(A7-36) 


In the same spirit, we define an exchange operator Kj, which, because it involves an 
orbital exchange, must be written in the context of an orbital being operated on: 


Ke - | HOUD dog) (A7-37) 


This enables us to write ôK;j 


8Kij = f iom dn + [ sso Rie dn 
+ complex conjugates 


Employing the operators J and K, Eq. (A7-31) can be written as follows: 
25 [sero nre +) 3A) - £j0)6;) — Y «6,0 Jan 
i j j 
+20 J seo | ng er +) eË- EDAD 
i j 
-5 geo] dr, —0 (A7-38) 
j 


Here we have made use of the hermitian properties of H*9'*, Wi , and K , and also the 
relation e j; f 5;(1)$* (D) dt =e; f 89i (1)$7 (1) dt, which is merely an index inter- 
change. 

Since the variations 5¢7 and 5¢; are independent, each half of Eq. (A7-38) must 
independently equal zero. Hence, we can select either half for further development. We 
will select the first half. This equation indicates that the sum of integrals equals zero. 
Either the integrals are all individually equal to zero or else they are finite but cancel. 
However the latter possibility is ruled out because the variations 897 are arbitrary. By 
appropriately picking 897, we could always spoil cancellation if the various integrals 
for different i were nonzero. But the equation states that the sum vanishes for every 
897. Therefore, we are forced to conclude that each integral vanishes. 

Continuing in the same spirit, we can conclude that the term in brackets in the 
integrand is zero. For the integral to vanish requires the integrand either to be identically 
Zero or else to have equal positive and negative parts. If the latter were true for some 
choice of 897, it would be possible to change 897 so as to unbalance the cancellation 
and produce a nonzero integral. Since the integral is zero for all 5¢;, it must be that 
the bracketed term vanishes identically. Thus, 


| ir Bode £a» eo =Y 'ej4;(D) (AT-39) 
j j 


for all i= 1 to n and for a certain set of constants €;;. 


622 Derivation of the Hartree-Fock Equation 


The original development of SCF equations was performed by Hartree for simple 
product wavefunctions. Fock later extended the approach to apply to antisymmetrized 
wavefunctions. For this reason, the collection of operators in brackets in Eq. (A7-39) 
is called the Fock operator, symbolized Ê, and Eq. (A7-39) becomes 


FDD =Y 6j6;0) (A7-40) 
J 


Equation A7-40 is a differential equation for each MO ¢;. But as it stands it is 
not an eigenvalue equation because, instead of regenerating $;, we obtain a sum of 
functions @ ; times the various unknown constants e;;. However, there remains a degree 
of freedom in the problem that can be used to throw Eq. (A7-40) into eigenvalue form. 
It is pointed out in Appendix 2 that the value of a determinant is unchanged if any row 
or column, multiplied by a constant, is added to any other row or column. This means 
that a Slater determinant of "best" MOs is unaffected by such internal rearrangements. 
In other words, if we were to solve Eq. (A7-40) for a set of "best" MOs, o? , we could 
form various new orthonormal MOs, (e.g., o? + rikp, op — MiP) by mixing them 
together, and our wavefunction v, and all values of observables predicted from v, 
including E, would be precisely the same. 

A transformation that mixes the MOs @ without affecting the property of orthonor- 
mality is called a unitary transformation (see Chapter 9). Letting U stand for such a 
transformation, we have that a transformed set of $'s, called ø’, is given by 


C=) Uds. geb (A7-A1) 
j 


In matrix notation, this is 
p’ = oU (A7-42) 
where ©’ and ® are row vectors, viz. 
$'— (040, --- Dh), (A7-43) 
and U is ann x n matrix, with 
UU! 2U'U- 1 (A7-44) 
In terms of these matrices, Eq. (A7-40) is 
FO=0E (A7-45) 
where E is an n x n matrix. If we multiply this from the right by U, we obtain 
FOU = EU (A7-46) 
Inserting 1 (in the form UU!) between ® and E gives 
Ê®U = $UU'EU (A7-47) 
or 


F$'—$'U'EU (A7-48) 
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We can now require that the matrix U be such that U'EU is a diagonal matrix E’. 
(This requires that E be a hermitian matrix, which can be shown to be the case.)? This 
requirement defines U, and we have 


FO'='E’ (A7-49) 
which corresponds to 
Posdé 41952 (A7-50) 


This equation has the desired eigenvalue form, and is commonly referred to as the 
Hartree-Fock equation. It is discussed at length in Chapter 11. 

It is important to bear in mind that our transformation U is for mathematical conve- 
nience and has no physical effect. We may imagine that our original basis set spans a 
certain function space, and that solution of Eq. (A7-50) produces a set of occupied MOs 
$; that span a "best" subspace. Transformations by unitary matrices produce new sets 
of MOs $" but these still span the same subspace as ¢’. However, they are generally 
not eigenfunctions of F, and satisfy the less convenient Eq. (A7-40). Nonetheless, 
there are occasions when it is useful to use some set of MOs other than ¢’, and we can 
always do this without having to worry about introducing physical changes as long as 
our converted MO’s are related to $' by a unitary transformation. 

The Hartree-Fock equation is ordinarily used in quantum chemistry in connection 
with a basis set of AOs, and it is possible to carry through a derivation of the Hartree— 
Fock equation for this type of basis. Detailed treatments of this derivation may be 
found in the paper by Roothaan [2] and in the book by Pople and Beveridge [3]. 
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Appendix 8 


The Virial Theorem for Atoms 
and Diatomic Molecules 


[J A8-1 Atoms 


In Chapter 3 it was shown that, for the ground state of the quantum-mechanical harmonic 
oscillator, the average value of the kinetic energy is equal to the average value of the 
potential energy. We now consider how the average electronic kinetic and potential 
energies are related in an atom. We begin by deriving a rather general expression, 
and then we discuss how it applies to different levels of calculation. As our first step, 
we examine the effects of coordinate scaling on average values. In order to follow 
this discussion, it is useful to recall that one can manipulate variables and limits in an 
integral as follows: 


b b nx=b x=b/n 
ri feds- | fordy= | fidam 7n | fGx)dx — (A8-1) 
a a nx=a x=a/n 
Let V (ri, r2, ... , r;) bea normalized function of the space coordinates of n electrons. 
Let 
T = fw) (A8-2) 
V = (WIV Ip) (A8-3) 


where 7 and V are, respectively, the kinetic and potential energy operators for some 
system, and are independent of spin. 

We introduce a scale factor 7 into y. This factor affects the lengths of the vectors 
r; but not their directions. That is, 


Wy = YNY, nro,..., NEn) (A8-4) 


If n > 1, Yn is more contracted in 3n-dimensional space than w. For 5 < 1, Yn is 
more diffuse. 
We must check to see if our scaled function yn is normalized. We know that 


1-2 foo... ra) (ri, ..., ro)dv 


= [om NEn) Y Gr... NEn) d (Nv) (A8-5) 


because we have simply relabeled all variables r by nr, including the volume element, 
just as in Eq. A8-1. We now factor 5 out of the volume element and divide the limits of 
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integration by 7, just as in Eq. A8-1. However, the limits are zero and infinity, so they 
are unaffected. The volume element d (v) is given by 


d(nv) = (nri)? sin 01d (nri) déidói (r2)? sin 02d (nr2) dado -- - (A8-6) 


and so n° appears for each electron. Thus, we are led to 


124" J Vs dv (A8-7) 


3n/2 


Therefore, our normalization constant for y, is n , and our normalized, scaled 


function is 
Vra =" (nri, nro, ... , NEn) (A8-8) 


[Compare this with the specific example encountered in Eq. (7-7).] We now inquire as 
to the values of V, and T}, where 


T, = (sl Pro) (A8-9) 
V, = (P Wn) (A8-10) 
For an n-electron atom, 
A [x 
a A (A8-11) 


312293 $c l/rij (A8-12) 


i=l j=i+1 


j 
ii iM: 


Therefore 


rai PC (= 


i=1 


2) 45 es ; VOr,...)dv  (A8-13) 


ici uen 


We could make the integral equal to V if we could get the scale factor into all the r terms 
in the operator and also into dv. The volume element dv requires n°”, which is already 
present in Eq. A8-13 from the normalization constants. To get n into the operator, we 
need to multiply the operator by n~!. Multiplying Eq. A8-13 by nn~! gives, then 


P on f vtm Y( =) + x 


t=1 t=1 j=i+1 


m V(gri,...)d(gv) (A8-14) 
ij 


or 
V,=nV (A8-15) 
The same approach to T, y gives 


=n T (A8-16) 
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This arises from the fact that 


yi 8 (23 Em uns p o (A8-17) 
= r : sin = 
r2ar\ ðr)  r?sin0 90 30) r2sin?@ ag? 


and scaling the r terms here requires multiplying by n~*. Hence, the integral is multi- 
plied by ?n~? in the final step. 
The general result is that, for any quantum-mechanical system where 


V=f(r~’) (A8-18) 


scaling results in 


T,=n°T,  Y,-w'V (A8-19) 


For atoms, P contains r as r~!. For all systems, T involves V2, which contains r to 
the net power of —2. We can now write the expression for the total energy of the atom 
as given by the scaled function 


E= +7, =n T +7 (A8-20) 


Now we can seek the best value of the scale factor. We do this by minimizing E; 
with respect to variations in n: 


dEn/dn=2nT +V =0 (A8-21) 


(T and V are independent of 7.) 

We are now in a position to make some statements about the average values of fT 
and V for certain wavefunctions. Let us consider first the exact values of T and V. We 
know that, if y were an exact eigenfunction, no further energy lowering would result 
from rescaling. That is, n equals unity in Eq. A8-21. As a result, 


2T -Y —0 (A8-22) 
or 
y =-2T (A8-23) 
or, since 7+ V = E, 
E--T- z? (A8-24) 


Thus, for an atom, we know that the exact nonrelativistic energy is equal to minus 
the exact average kinetic energy and is equal to one half the exact potential energy. 
Knowing that the exact energy of the ground-state neon atom is — 128.925 a.u. enables 
us to say that T = +128.925 a.u. and V = —257.850 a.u. without actually knowing y. 
Moreover, the same relation holds for any stable state of an atom. 

This same argument holds, not only for exact solutions, but for any trial function that 
has already been energy-optimized with respect to a scale factor, for then a new scaling 
parameter n gives no improvement, n = 1, and all is as above. Thus, any nonlinear 
variation scheme consistent with uniform scaling should ultimately lead to the relations 
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[Eqs. A8-22-A8-24]. Satisfying these relations is frequently referred to as satisfying 
the virial relation. Completely optimized single-¢ and double-Z functions satisfy the 
virial relation. 

It follows that Hartree-Fock atomic wavefunctions must satisfy the virial relation. 
Such solutions are, by definition, the best (lowest energy) attainable in a single deter- 
minantal form. “Best” includes all conceivable variation, linear or nonlinear, so all 
improvements achievable by scale factor variation are already present at the Hartree— 
Fock level, and 7 = 1. 

In the event that E can be lowered by scaling, it is possible to evaluate the optimum 7 
from Eq. A8-21, which gives 


n=—V /2T (A8-25) 


One of the useful applications of the virial theorem is as an indicator of closeness of 
approach to the Hartree-Fock solution for an atom. If the calculation involves nonlinear 
variation (uniformly applied to all r coordinates), then the resulting wavefunction will 
satisfy the virial relations no matter how deficient it is as an approximation to the 
true eigenfunction. However, if the calculation involves only linear variation, as for 
example, when a linear combination of gaussian functions is used to approximate an 
AO, then there is no guarantee that the virial relation will be satisfied. If the basis set 
is extensive enough, however, the Hartree-Fock limit will be approached, and V /T 
will approach —2. Strictly speaking, a linear variation calculation on an atom that 
gives V /T — —2 is simply one that cannot be improved by uniform scaling. Therefore, 
approach to —2 is not a guarantee of approach to the Hartree-Fock limit. Itis a necessary 
but not a sufficient condition. 


[J A8-2 Diatomic Molecules 


The treatment here is very similar to that for atoms. We make the Born-Oppenheimer 
approximation by assuming that y depends parametrically on the internuclear separa- 
tion R: 


V — V(ri, r2, ... , ra, R) (A8-26) 
When we scale r;, we scale R as well: 
Wy us v Gri, nro, etos NEn, nR) (A8-27) 


Henceforth, we let nR = p. Performing the same variable manipulations as in 
Section A8-1, we find 


n = TQ. o) =n T, p) (A8-28) 
n= V (n, p) 2 nV (, p) (A8-29) 


Here, P may or may not include the internuclear repulsion term. This gives, for the 
total energy, 


E, =n T (M, p) 4- nV (1, p) (A8-30) 
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Upon taking the derivative with respect to n, we obtain 


JE. z x aT, 9V (1, 
= =2nT (1, p) -V (1, p) +n? 2. p) Hn = P) o (A8-31) 


This differs from the atomic case in that V (1, p) and T (1, p) depend on 7 through p. But 


eG em 
an 00 an 0p 


and so Eq. A8-31 becomes 


dE E a oT (1, V (1, 
25» 9 =2nF(1,p) + nV (1, p) +n R—— RTL CP) —— (A833) 
an 00 00 
If we assume that y is the exact eigenfunction, then 7 = 1, and 
— ðE 
2T+V+R (55) =0 (A8-34) 


Indeed, this relation holds for any case in which all improvement in the nature of 
a scale factor variation has been made, such as, for example, the Hartree-Fock limit. 
Note that, if V contains internuclear repulsion, E is the total energy. If not, E is the 
electronic energy. 


A8-2.4 Problems 


A8-1. Use the methods outlined in this appendix to show that V = T for any stationary 
state of the quantum mechanical harmonic oscillator. 


A8-2. Evaluate V and T with y = 1/4/z exp(—r) for the Li?* ion. From these, 
establish the optimum scale factor and write down the expression for the 
normalized optimized v^, and the optimized energy E}. Compare these results 
with the eigenfunction for Li^*. 


Appendix 9 


Bra-ket Notation 


Bra-ket, or Dirac, notation is frequently used in the literature because of its economical 
form. Perhaps the best way to learn how this notation is used is by studying its use in 
a few familiar relations and proofs. Accordingly, we have outlined a few of these uses. 
The applications and subtleties of this notation go considerably beyond the treatment 
summarized here.! 


“Usual” notation Dirac notation 
Pn dndt E ($m | on) = (m|n) (A9-1) 
—— —— 
bra ket 
Soh Ao, dx =  (¢m|Alon) = (m|A|n) = Amn (A9-2) 
[S Ot gndt]” = f óióndt (om |n)* = (nlm?) 
or (A9-3) 


(m|n)* = (n|m) 


For hermitian operator A: 


Sn Agndt = f n(Adm)*dt (m|A|n) = (n|A|m)* 
= [fo Admdr]” (A9-4) 
Any function y can be written as a sum of a complete set of orthonormal functions $: 
V — din nOn |) =} m CmlOm) = Yom Cn m) 
[ didt =} mcm f ó5óndt — es (nly) = Qin Cm (n|m) = cn 
c, = f piyd Cn = (n|v) 
V = Yin nón = Yim f bmvAt om — LW) = Lin (al) lm) 
=>} n lm) (my) 


1 Strictly speaking, for instance, (Øm |n) and (m|n) are not identical in meaning. The former refers to specific 
functions, dm and n, which represent state vectors in a specific representation. The latter refers to the state vectors 
in any representation and hence is a more general expression. Distinctions such as this will not be necessary at 
the level of this text. 
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Example of Use: Proof that eigenvalues of A (hermitian) are real. 


A|m) = ag |m) (A9-5) 

(m |A|m) = aj (m|m)., (A9-6) 
— 
0, Foo 

(m|A|m)* = ap (m|m) (A9-7) 


Combining Eqs. A9-4, A9-6, and A9-7, we have 


(am — až) =0 
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Values of Some Useful Constants 
and Conversion Factors 
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Appendix 11 


Group Theoretical Charts and Tables 


[J A11-1 Flow Scheme for Group Symbols 


Is the molecule linear? 


No Yes 
Does the molecule Is there a reflection plane 
possess six C; axes? perpendicular to the axis? 
Yes No Yes 


No 


T, T, Ty, O, or Op 
(Examine character tables 
to choose among these) 


Does the molecule 
possess four C; axes? 


No 


Are there one or more 
proper rotation axes? 


Is there a point 
of conversion? 


Yes 
No 
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Is there an axis is having 
higher order than any 


= 

o ^ 
25 other proper axis? 
49 
OL 
o o 
S 
Cg 
gs f 
ot Of the several axes having 
a © s s 
e A highest order, is there one 
2 2 which is geometrically unique? 
S90 
oa 
ST 
o c 
oo 


Choose any of the 
equivalent axes as Cn 


(Cn and n now defined) 
Is there an Son axis 
collinear with Cn? 

No 


Select this as 
reference axis Cp 


Are there any symmetry 
elements present other 
than Cn, Say, or i? 


Are there n twofold axes 
perpendicular to Cn? 


No y (C branch) 


Yes (D branch) 


Is there a reflection Is there a reflection 
plane (ch) perpendicular plane (6n) perpendicular 
to Cn? to Cn? 


Is there a set of 
Is there a set of n reflection planes 
n reflection planes containing Cj and 
(oy) containing Cn? bisecting the n 
twofold axes? 


NH A11-2 Meaning of Labels for Representations 


Symbol Interpretation 
Main Symbol 
A One-dimensional representation symmetric for rotation by 2zr/n 


about principal axis. (For c1, cs, c;, which have no principal axis, 
this symbol merely means a one-dimensional representation.) 


B One-dimensional representation but antisymmetric for rotation by 
27x / n about principal axis 

E Two-dimensional representation 

T (or F) Three-dimensional representation 

G Four-dimensional representation 

Subscripts 

1 Symmetric for perpendicular C2 rotations (or else oy or og reflec- 
tions) 

2 Antisymmetric for perpendicular Cz rotations (or else oy or oq 


reflections) 
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Superscripts 
1 


LÁ 


[J A11-3 Character Tables 


Symmetric for inversion 
Antisymmetric for inversion 


Symmetric for op reflection 
Antisymmetric for op reflection 


Group Theoretical Charts and Tables 


A11-3.A Special High-Symmetry Groups: Coy, Don; I, In, T, Th, 


Ta, O, On 

Coov E — 2C? 000, 

A= X* 1 1 1 2 xc deg at 
A52 X7 1 1 —1 Rz 

E =I 2 2 cos o (0) (x, y); (Rx, Ry) (xz, yz) 
Ex=A 2 2cos2® 0 (x? — y, xy) 
E42 2 2cos3@ 0 

Boa E 2C% 500p i 28%? - œC2 

$a rd 1 1 0 1 1 x^ yz 
Eq 1 —1 1 1 -1 |R 

IIg 2 2cos® ... O 2 —2cosD -... 0 (Ry, Ry) | (xz, yz) 
Ag 2 2cos2b ... 0 2 2cos2®--- 0 (x^— y^ xy) 
Eat | 1 1 1 1 —1 -1 |z 

Ea | 1 1 a xx SL 1 

Iu 2 2cos® 0 —2 2cos® - 0 (x, y) 

^u 2 2cos2® 0 —2 —2cos2® ... 0 

I |E 1265 12C5?  20C3 15C; 

A |1 1 1 1 1 x+y 442 
7|31ü-451ü-4/5 0 -1 [Gy z; (Re, Ry, Re) 
7|335ü-45 jü-V5 0 -1 

G]|4 eed EX 1 0 

H|5 0 0 «i. 4 Q2 -x?- y, 


x2— y, Xy, yz, zx) 


I- 1 0 0 s- I I- 0 0 s | "H 
0 I- I I *- 0 I I- I- py |") 
I 0 (GA-DE- CAt+DE- le I 0. GA+DE (^-—pt e |" 
x) | 1 0 (g^-DÍ- (g^-D£- le I 0 (g^-D* GAt+DE g |"u 
je qe i= I- i= i I I I Te |e 
(xz ‘zd *&x 
‘gh = 2% " 
(A su e EC) I De 0 0 S I [= 0 0 ¢ | °H 
0 I I- [> v 0 I je I- rp | °D 
I= d. OXeM-DI (^++DE |e I- 0 GA+DE A-D? e | Fz 
CaA |i o eAdepr- (enni ug I= 0 GA- pr (eNSEDT e |u 
RE: I I I I I I I I I PE 
“sr OT PS7 OlecT 1 096p $000 SIU Su a| Y 
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640 Group Theoretical Charts and Tables 
T| E 4C) 4C 3C, e =exp(27i/3) 
2b t 1 1 1 KP ag +2 

1 € e* 1 
He iy x (222 — x2 yat yy 
T 3 0 0 —l (Rx, Ry, Rz); (x, y. Z) (X y, XZ, yz) 


Th E 4C3 403? 3€) i 4S6 4565 30n € =exp(2xi/3) 
As do 1 1 p 1 cd 1 l x.y- 2 
A. | 1 1 1 1 1 1 1 1 

]: e & qe 1 € € 1 Qz-— = y, 
Eg | 1 æ 1 1 e 1 ceo 

€ € € € x y 

E, | 1 «€ e* ub 1 € e* ! 

1 «e«* e 1 1 e* € 1 
Tg 3 0 0 -1 1 0 0 -=l (Ry, Ry, Rz) | (xz, yz, xy) 
Tu 3 0 0 —-1 -1 0 0 1 (x, y, Z) 
Ta | E 8C3 3C; 684 604 
A, | I1 1 1 xo jh +z 
A |1 1 1 E el 
E |2 c 2 0 0 (02^ X^ — y, xy) 
T | 3 0 -1 e] (Rx, Ry, Rz) 
T, | 3 0 —1 —1 1 (x, y, Z) (xy, Xz, yz) 
O | E 6C, 3C2x(=C4?) 8C3 60 
4 |1 1 1 1 1 x+y +Z 
A |1 wed 1 Pod 
E 2 0 2 zs 0 

Qz -x?- y’, 
x2 — y? 
T | 3 1 -1 0 =l (Rx, Ry, R2); 
(x, y, 2) 

To 3 -l —1 0 1 (xy, Xz, yz) 
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3C» 
On E 8C36C2 6C4(— C4?) i 6S4 8S6 30n 604 
Ag | 1 1 l1 1 dq 1 1 1 xL ++Hy +Z 
Ag |1 1-1 -1 1 iie E ee 
E. | 2-1 0 0 2 2 0-1 2 Qz-x?- y, 
x? — y*) 
Tig |3 0 -1 1 -1 3 1 0-1 -1 (Ry, Ry, Rz) 
Tg 3 0 1-1 =1 3 —1 0 —1 1 (xz, yz, xy) 
PEE ME 1 1 Ln aei st 
Aw D E 222 E ae a na. 3 
E, |2-1 0 0 2 2 0 1-2 0 
Tiu 3 0 -1 1 -1 -3 -1 0 1 1 (x, y, Z) 
Tu |3 0 1-1 -1 -3 1 0 1-1 
A11-3.B Groups with No Axis of Symmetry: C1, Ci, C; 
C,|E 
All 
Cs | E on C; | E i 
æ |1 1 | x,y, R; | x2, 9, Ag |1 1 | Ry, Ry, Re | x?, yz 
zZ, Xy Xy, XZ, yz 
A" | 1—1 |z, Rx, Ry | yz, xz Ay |1 -1 | x,y,z 
A11-3.C The S2, Groups 
S4 E S4 C2 S43 
ze à 1 1 1 Ri 3^4 y m 
B 1 -1 1 -1 Z X 54" ay 
s ilia: Ti | ys (Re Ry) | Gzy2 
1 —i —1 i 
S6 E Qu. i SS S6 e =exp(27i /3) 
Ag S a | 1 1 1 Rz x pet 
1 € e* 1 € €* 2 2 
Ry, R x^ — y^, xy); (XZ, yz 
Eel h fe £ o. fe Eb | dem] et». x» ez ya 
Ay 1 1 1 —1 —1 —1 Z 
1 € e* —] —e —e* 
X, 
Eu hi €* € -l -e* 2] SE) 
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Sg E Sg C4 Sg? C5 Sg? Có? Sg” e =exp(2xi/8) 
A T. 1 1 1 1 1 R: x ym 
B 1 -1 1 -1 1 — 1 — Z 
E | 1 € i e* 1 € i e* (x, y); 
IJU e -ij -e -1 -e& i el | (RyRy) 
io EX =p v vpo--Lh o wx | T 
E n "MC" gl = Xx) 
] —-e* -i € —1 e* i  —e 
Es li —é i e* —-1 € -i B^ (xz, yz) 
A11-3.D The C, Groups 
QIE C, 
A 1 1 | z, R- X2, y, Z, XY 
B 1 —-l | x,y, Rx, Ry | yz, xz 
C3 E G O? € =exp(2xi/3) 
A 1 1 z, Rz x? + y, z2 
1 kK 
[fe Ef | Gs E O22) 
C4 E CI C2 Có? 
A 1 1 I 1 z, Rz x? + y, z 
B 1 -=l 1 =l x?— y^, xy 
1 j —1 —i 
ho no 6e» es RD] G2, 22) 
Cs | E Cs C? C9 Cst e =exp(27i/5) 
A 1 1 1 1 1 z, R; x24 yz 
E 1 2 2x * 
DR RS S HED RR Oa 
E 1 e d gt NS 
: Ñ pn e g^ g? | GC YS) 
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Cg E Cg C3 C5 C3? Cg e =exp(271/6) 
A 1 1 1 1 1 1 z, R; xps Z 
B 1 -1 1 —1 1 —1 
1 ée --e* -l -—e e* | (x, y) (xz 
, yz) 
E | 1 e* —e -l =g" € (Rx, Ry) 
1 -e* -—e 1 —e*  —e€ | (2 — y?, xy) 
E» | ] —e —e* 1 —€ —e* 
C7 E C4 C7” C? C44 C? C76 € — expQzi/7) 
A b. 1 1 1 1 1 1 z, Rz Xxo4 y 
1 € e? e? e* e2* e* (x, y) 
E li PA M XN e e | (Ry, Ry) (xz, yz) 
1 e p lu € e en 
E» | 1 c?* 3 € e* e* e | (x^ = y^, xy) 
(EE e* E: geh e et 
E3 hi pi Ze eœ ge e* e | 
Cs | E Cs C4 C2 C4 Cs? Cs? Cs” € = expQzri/8) 
AS | ie 1 1 1 1 1 A 1l z, R; xL Hy, 
B 1 —1 1 1 1 -1 -1 1 
1] € i 1 i e* —e -1 ; 
ER soe me cu ce ve. Seq agn 
t g he Soc ue) | a 
E li crx cud ee PCT 
] —e i -l -i €* € s] 
Es li —e* —i -—] i € e* —e 
A11-3.E The C,, Groups 
CN | E C2 Ov(xz) ov (yz) 
Ay | 1 1 1 1 z x?, y*, Z2 
A5 1 1 —1 Rz Xy 
Bj 1 -1 1 — x, Ry | xz 
B5 1 -1 —1 y, Ry | yz 
Cay E 30y 2C3 
A |1 1 1 |z x pari 
a» |1 -1 1 |R 
E |2 0 -1 | Gy); Re, Ry) | @?- y, xy); Gz y2 
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Cay E 2C4 C5 20, 20g 
A, |1 1 1 z EYT 
A |1 1 b = — Rz 
B, |1 -1 1 2| x-y 
By |1 =1 1 =l 1 xy 
E |2 0 2 0 | Gy); (Rx, Ry) | Gz yz 
Cy | E 205 205? 5o, 
A, |1 1 1 1 z x^4y^tz 
A | 1 1 1 -1 | R 
Es |2 2cos72° 2cos144° 0 | œ, y); (Rx, Ry) | @z, y2) 
E» |2 2cos144° 2cos72? 0 (x? — y*, xy) 
Ce, | E 2C6 2C3 C2 30y 304 
A | 1 1 1 1 1 1 z I EVG 
Ay |1 1 1 1 -l1 -l |È 
By |1 ee 1 E 1 wk 
By |1 -=l p] xp c 1 
E; |2 1 -l -2 0 0 | Gy); (Re, Ry) | Gz yz 
E |2 -1 -1 2 0 0 (x^ — ^ xy) 
A11-3.F The C, Groups 
Co, | E C5 i Oh 
Ag 1 1 1 Rz x2, y, Z, XY 
Bo 1 —1 1 — Ry, Ry | xz yz 
A |1 1 -l ss z 
Ba 1 -1 —1 1 x,y 
Can C3 C63 on S3 S35 e =exp(27i/3) 
A' 1 1 1 1 Re x2 + yz 

) € € 1 € A 2.2 
kde de a G9 Xo COP» [60539 
A" 1 1 1 1 1 z 

€ € 1 € e* 

er | | NE a 5 lef | Ree Ry) | Gz, 92) 
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Oh S4 


S4? 


Cah 


e =exp(27i/5) 


C3? Cs? C54 Oh Ss S57 S5? Ss? 


E Cs 


Csh 


= 
^h a 
M BN 
a | 
hu E 
A SS 
ba 
EN 
n 
e X 


— — — — 


-— 


A" 


(Rx, Ry) | Œz, yz) 


€ =exp(27i/6) 


S3 


S6 


S35 Sg on 


i 


E C6 C3 Co C3? Cg? 


Con 


x24 yz 


(x? -= y^ xy) 
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A11-3.G The D, Groups 


Dı | E Coz) Co(y Cx) 


A |1 1 1 1 x,y Z 
By 1 1 —1 —1 z, Rz | xy 
Bo 1 —1 1 —1 y, Ry | xz 
B3 1 —1 —1 1 x, Rx | yz 


Ds: | E 2G 30 


A l1 1 1 years 
—1 z, Rz 
0 | (x,y); (Re, Ry) | 62-2, xy); (xz, yz) 


ty a 

N 
Ne 
| 


D4 | E 20 Co(—-C4à) 207 20" 


Alli 1 1 Lpy 
A |1 1 1 — — z, Rz 

Bi |1 -1 1 1 -=l x-y 
B Lo 1 el 1 xy 

E | 2 0 —2 0 0 (x, y); (Ry, Ry) | (xz, yz) 
Ds | E 2C5 2C5? 502 

A | 1 1 1 1 x ye? 
Ae“ 1 1 e: |z R 

Ei | 2 2cos72 2cos144° 0 | (x,y); (Re, Ry) | (xz, yz) 

E2 | 2 2cosl44? 2cos72? 0 (x? — aep) 


A sd 1 1 1 1 1 x^-yhz 
A |1 1 1 D -1 = dp 

Br |1 at Dl 1 —1 

B |1 -1 fis - E 1 

E, | 2 1 —1 —2 0 0 (x, y); (Rx, Ry) | Gxz, yz) 

E |2 -1 -l 2 0 0 (x? — y?2, xy) 
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Ne Ree 


R; 


Z 


(x, y); (Rx, Ry) 


xc septa 


x? ME y 
xy 
(xz, yz) 


N- -Nem 


Rz 
(Rx, Ry) 


Corr ORF m. 


Z 
(x, y) 


x1 yg 


(x? — y*, xy), (xz, yz) 


NNN RRR eR 


E 


2S}, 


x2 X y, z2 


(x, y) 
(x? s y^, xy) 


(Ry, Ry) | (xz, yz) 


1 
1 
22 
Pape 
1 
1 
2 
2 


cos 72° 


2 cos 72? 
2 cos 144? 2 cos 72? 


1 1 
1 1 


2 cos 144° 
cos 144° 2 cos 72° 


2 cos 144° 


-1 


1 
1 
2 cos 72° 
2cos 144? 
—1 


—1 
2cos144° 0 
2cos72° 0 

—1 —1 
—1 —1 -1 1 
0 —2 -—2cos72? —2cosl44? 0 
0 —2 -—2cosl44? —2cos72? 0 


x? + y, 2 
Rz 
(Rx, Ry) | Œz, yz) 


Ge? -hxy) 


Z 


(x, y) 


by 


28? 


C2 
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ron 
Q 
B 
[e 
2 


NNNNNR RR Ke 


The D,4 Groups 


Pe Ds M aO 
Oo 


Co(2 Co(y Cx) 


a 


-. 


NNNNNRR Re 


x^ Ly 


Rz 


oo oocoOCorR Fe eS FE 


Z 


(x, y) 
2 


(x? — 2, xy) 


[«» Men» Me» em M r Fe FE 


(Rx, Ry) | (xz, yz) 


o(xy) o(xz) o(yz 


pt ee ee a ai 


x?, y^ z 


— Ry | xz 
Ry | yz 


Se ea i —o- 


x2 X y, z2 
R- 
(x, y) (x? — y^, xy) 


Ne RB NKR 


Z 
(Rx, Ry) (xz, yz) 


Corre ORF eK 
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Danh E 2C4 C5 2C» 2C5" i 284 Oh 20, 204 

Ag St 1 1 1 1 38 1 1 1 1l x L35,2 
Ag |1 1 1 -1 -1 1 1 1-1-1 |R 

Bg|1—-1 1 1-1 1-1 1 1] -I xy 
Bog | 1 —1 1 -l 1 1 -1 1 -1 1 xy 

Eg 2 0 -2 (0) (0) 2 0 -2 0 0 (Rx, Ry) | (xz, yz) 
Ae cb o d. T- at. + xp po 

Ae Mt Os T -SiS 4p .ess 2. Ta o d*. m 

Ba leer Gp: oed. UTI xh, 

Ba qwe p cem rod epo qoid Cd. d 

Eu 2 0 -2 (0) 0 -2 (0) 2 (0) (0) (x, y) 

Ds, |E 205 2C 5C2 oh 255 2853 50y 

A, |1 1 1 1 1 1 1 1 x^ ye 
Ay! | 1 1 5s smp. vl 1 1 —1 RS 

E, |22cos72? 2cos144° 0 2 2cos72? 2cosl44° 0 | (x,y) 

Ey! | 2 2cos 144° 2cos722 0 2  2cosl44? 2cos72° 0 (x? — y?, xy) 
A, |1 1 1 1 oA —1 —1 I 

A5" | 1 1 1 -1 eso el -1 1 |z 

Ej" | 2 2cos72? 2cosl44? 0 —2 —2cos72° —2cosl44? 0 | (Ry, Ry) | (xz, yz) 

E” | 2 2cosl44? 2cos72° 0 —2 .-—2cosl44? —2cos72° 0 


Don| E 2C6 2C3 C2 3C2' 3€?" i 283 286 on 304 3o 

Aw d 1 1 do 1 1 1 1 1 1 1 d x+y, 
A| l 1 1 1 -i -l 1 1 1 1 -l P| oe 

Bg|1]-1 1 -1 1-1 1-1 1-1 #1 -1 

Bg|1-1 1-1-1 1 1-1 1-1-1 1 

Eg|2 1 -1 -2 0 0 2 1 -1 -2 OO | (Rx, Ry) (xz, yz 
Exy|2-1 -1 2 0 0 2-1 -1 2 0 0 (x? — y?, xy) 
ad .p te E deo epos eser seed 

dA eds Cisl seth eb te-4bu-—k.3b5 di 

Reliab Tab. dob Sh | ere dep d 

Bal Aes 0) Gah ah A ee A ee eet 

Ew|2 1 -1 -2 0 0 -2 -1 ED: 0638 

Ex| 2-1 -1 2 0 0 -2 Up 1 -2 0 0 
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—2 
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2 


cocoons 


—1 
-v2 
0 

2 
1 
1 


C =- = 


(x, y) 
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Hints for Solving Selected Problems 


Chapter 1 
1-1. Use Eq. (1-25). 
1-7. 
V Gr) 1 
PEO= -f m|? Y, 0/9? ] dw, 0 = smo W(x, t). 
0 
Next integrate P.E.(t) over one complete cycle (0 — £^). 
Chapter 2 


2-9. sinx sin y= J [cos(x y) — cos(x + y)]. 


2-10. What kind of function has Ar = oo? When could such a function join smoothly 
onto a sine function in region I? 


Chapter 3 


3-1. Imagine an auto runs from A to B at 30 mph and from B to C at 60 mph. Sketch 
the distribution function for the auto. Then reason how you arrived at this function 
and apply similar reasoning to the harmonic oscillator. 

3-6. a) Seek points where Mz (y) equals zero. 

b) Seek places where dy” /d y — 0; then evaluate y? at these points and compare. 


3-15. 


oo 
j) exp(—y”)dy ~ 0.10 [exp(—1.05°) + exp(—1.15?) + exp(—1.257) +- ] 
4 


to convergence. 


3-19. Consider how the solutions for the harmonic oscillator would meet the conditions 
imposed by this new potential. 
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Hints for Solving Selected Problems 


Chapter 4 


4-3. 


4-11. 
4-12. 


4-17. 
4-18. 


4-26. 
4-37. 


The classical turning point occurs when the total energy equals the potential 
energy. 


E — V (r) when r equals classical turning point. 


Ignore all but the 0 and $ dependences in Eq. (4-30). Do not forget to square 
these dependences, and do not forget to include the 3!? term of 3d?. 


x =r sinÓ cos $. 
a) Do not forget to include 0 dependence of dv. 


Refer to Problem 2-11. 


u should be in units of kg/molecule, and you have masses in a.m.u. These can 
be taken directly as g/mole and then converted. 


Chapter 5 


5-4. 


Square y and integrate, using the fact that 1s, 2s are orthonormal. 


Chapter 6 


6-9. 


6-11. 
6-13. 


6-18. 
6-24. 


b) Y*W must be same at t = 0 andt=1/v. 


Use the Schmidt orthogonalization method. 


a) Means: is y an eigenfunction of the momentum operator? 
Wavefunction is not normalized. 


Use that exp(ikx) — cos(kx) +i sin(kx). 


Chapter 7 


7-4. 


7-11. 
7-13. 


7-16. 


7-17. 


a) Note that, for symmetric wz, f y ódx =2 n ^y,ódx. For antisymmet- 


ric Yn, the integral can be evaluated by inspection. A useful integral is 
f x sinx dx =sinx — x cos x. 

c) Use the fact that E — X, aad The series can be estimated with a small 
calculator (tedious) or else by summing the first few terms and integrating 
over a function that envelopes the higher terms. 


Use the fact that @ = X;c;y; and E = Lich c; Ej. 
b) Take limit as F — 0 rather than simply evaluating at F — 0.1. Note that 
(14- x)" =1+mx+[m(m — D/21] x? +. 


Do not forget that overlap between ha and pp must enter normality condition: 
e + c? +2cacpS = 1 


f? (—1/ra)dv = 1+ (£ + D)exp(—2£) using the method of Appendix 3. 
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7-19. Note that y+ and y_ are degenerate at R = oo. 


7-20. In the limit of R — 0, H44 is indeterminate. Use |’ Hospital's rule (1.e., take 
d/d R on the numerator and denominator and evaluate at R = 0). Alternatively, 
you can expand exp(—2 A) in powers of R and evaluate at R — 0. 


7-27. Note that (f) and (g) have both AOs on center a. When His present in the inte- 
gral, you are restricted to considering symmetry operations that do not affect H. 


7-34. Use the Schmidt orthogonalization procedure (Chapter 6) to construct 2s’. 


Chapter 8 


8-15. b) Simply note where HOMO is bonding, antibonding, nonbonding, and rec- 
ognize that some of the effect of this MO will be lost upon ionization. 


Chapter 12 


12-2. Recall that perturbation should be greatest for states with y? largest in region 
of perturbation. 


12-7. To evaluate the first-order correction to the energy, you can recognize that 
(1s| — 1/r|1s) is identical to the potential energy of the H atom. The virial 
theorem tells you the value of this quantity at once. 


12-16. Use the fact that E; = x MOsj; +28 ZMOs y peighbors p 


Chapter 13 


13-9. Notice which MOs are degenerate when assigning symmetries. For the final 
part of the question, notice that the molecular x axis corresponds to the group 
theoretic z axis. 


Chapter 14 


14-13. Do not forget that the cyclobutadiene molecule differs from two acetylenes in 
both z and o systems. There are a total of eight orbitals to be sketched for each 
side of this reaction. 


Chapter 15 


15-2. The curve in Fig. 15-2b is proportional to the reciprocal of the slope of the curve 
in Fig. 15-2a. 


15-3. It is helpful to use that exp(inz) = 1 for even n, —1 for odd n, and that 
exp [/ (a + b)] 2 expGa) expGb). 


15-4. Remember that E is degenerate due to states at negative j. 
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Answers to Problems! 


Chapter 1 


1-1. 
1-2. 
1-3. 
1-4. 
1-5. 
1-6. 
1-7. 


[See Hint.] (4 + D+ C)cos(kx) + (B -- iC — i D) sin(kx). 

V (x) = same as Eq. (1-32) except cos instead of sin when n is odd. 
a = B? ORAS 

Work functions: Cs = 1.9eV, Zn =3.7eV.A =4.13 x 107! eVs. 
a) 0.055 nm. b) 3.31 x 10-2 nm. 

Argument of cosine must change by 27. Cycle time = 277/w. 


[See Hint.] Integrating PE(t) = im c? (x, t) over a cycle and dividing by £’ 
to give average PE per unit time gives PE = mo? y? (x)/4.[m is really pdx.] An 
identical result comes from integrating KE(t) = smo w(x) sin? (wt) over the 
same cycle. 


. a) No. Becomes infinite at x = +00. b) Same as a). c) Yes. d) No. Becomes 


infinite at x — —oo. e) Yes. 


. V —sinx or cosx are examples. 


. We need establish only one of the extreme profiles for the string. Then, as cos(wt) 


oscillates between +1 and — 1, the string oscillates between the two extremes. In 
other words, sin(x) and — sin(x) are the same solution at different times. (They 
differ by a phase factor.) 


. Only c), d) and f). [The latter is most easily seen after recognizing that the 


function equals exp(4ix).] For c), the eigenvalue is zero. 


Multiple Choice (MC): dcee b 
Chapter 2 


2-1. 


2-2. 


JPs^kg !m-?-— (kgm?s 2)?sec?kg !m ? —kgm?s ? =J. 


-1/2 -1/2 
us " 

A- [E fy siya) | = DER sin? yay] 
=(L.") "= 2 

™ Lag 2 "y L 


l Hints for some of these are given in Appendix 12. 


654 


Appendix 13 655 


2-3. 


2-4. 


2-5. 


2-6. 


2-7. 


2-8. 


2-9. 


2-10. 


2-11. 


2-12. 


2-13. 


Recognize that, since sin? + cos? — 1, js (sin? -- cos?)dx isthe area of a rectangle 
of height 1 and width L, i.e., has area L. Half of this area goes with sin?. For 
cases where n > 1, there are more wiggles, but the value of the integral over sin? 
still equals L/2. 


1 1 
0.6090, 0.1955, 3, 3. 


a) Use height times width of this narrow rectangle in w*. Height = 
[Q/27L) sin(0.5x L/L)] = (2/ L) sin?(1/2) =2/L. Width=0.01 L. Area = 
0.02. This is 2% of the probability density in 1% of the box width. That is 
twice the classical probability, which is uniform in the box. 

b) (2/L) sin? (7/3) times 0.01 L — 0.015. 1.5 times classical. 


a) 1/5. (There are five equal hills in we, and 0 < x < L/5 contains one of them. 
b) Smaller. V2 (Fig. 2-5) is small in this region (less than 1/Z in value) and 
integrates to less than 1/5. 


a) S. b)—A. c)SS. d)AA. e)—AS. f)AASASSA. gg) —AASASAA. 
Rule: Product antisymmetric when odd number of antisymmetric functions is 
present. 


All are zero by symmetry except d), e), h). i) is integral of 2 sin? x cos x which is 
SA. j) is integral of sin? x cos x which is SS from —z to z, but SA in each half 
of the range (1.e., like 7). If each half must give zero, so must the whole. 


[See Hint.] (2/L) m sin(nx x) sin(max)dx = (using Hint) (1/7) fj (cos[(n — 
m)y] — cos[(n 4- m) y]]d y — 0, for n and m integers and n zm. 


[See Hint.] Aj — oo, and so Wy is a constant. A constant has a zero derivative, 
and y must arrive at x = L with zero derivative if successful junction is to be 
made. This requires that an odd number of quarter-waves fit between 0 and L 
(so wave arrives with either a peak or a valley at x — L). This requires that 
[Qn - D/4] = L; Ap 2 AL/Qn + 1) 2 A/A2mU; U = Qn 4-1)? 82? /32n L? 
is the relation between U and L that is required for a state to exist at E = U. 
(Strictly speaking, one can only approach A = oo as a limit, and this problem is 
physically meaningless. However, it makes a good exercise.) 


Given that Hy, = E,W, Hy» = Eryn, Ej = Fo = E and d=cyW + eoo. 
Then H$ — ci Hyi +02AW2 — c Ei + co Eryn = E (cipi +022) = EQ. 
Q.E.D. 


a) y should oscillate on right with same A as on left. b) y should be symmetric 
or antisymmetric (and A should be same in each side). c) y should be smooth 
(i.e., have no cusp) at finite barrier. d) y should be a decaying exponential at 
right. e) Same as d). y should not become infinite. 


a) y4 is two sine waves. 


oN Oo, 


b) £4 is the same as E» in a half-box of width L : E4 = 2?n? /8m L? = h? /8m L. 
Or, one can also calculate it as the n = 4 solution with width = 2L. 
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2-14. 


2-15. 


2-16. 


2-17. 


2-18. 


2-19. 


2-20. 


Answers to Problems 


For exponential eigenfunctions, w*y = [exp(i J)" lexpa i¢)| = exp(—ij¢) 
exp(ij@) = exp(0) = 1. This gives a “rectangle” of height 1 and width 277, hence 
area 27 and normalizing constant 1/4/27. For sine or cosine functions, we get 
a squared function that oscillates from 0 to 1, gives us half the area, namely 7r, 
and yields normalizing constant 1/./7. 


expG /2¢) does not join onto itself when $ > $+ 27. 


a H=—(h?/8271)d?/d¢?. b) Hy 2 (98? [812 Dy. c)yg(h/2ni)d/dó]lv £ 


constant times w. Not a constant of motion. 


The barrier forces solutions to vanish at $ — 0. Of our four choices sin(k@), 
cos(kó), exp(tik®@) only the sin(k$) set has this property. Therefore a) E =0 is 
lost. b) All degenerate levels become nondegenerate. c) No, only sine solutions 
exist. d) No, sine solutions are not eigenfunctions of the angular momentum 
operator. 


a) Enx.ny = (h? /8m) (n2 / L2 -- n2/ L2)] = (2 +4n?)h? /8mL7. 
b) Zero point energy = E11 = 54? /8m L? 


C) ny 1 2 3 14 2 3 5 4 
ny 1 Y 1 2 1 e te. <2 
E/(h?/8mL2) 5 8 13 17 20 20 25 29 32 
—— 
degeneracy 1 1 1 1 2 1 1 1 
d) 


e) i) All eigenvalues increase by 10J. (ii) No effect on eigenfunctions. 


Both 34? /Am L? . Accidental degeneracy results when nodes that are not equiv- 
alent by symmetry nevertheless give the same energy. 


a) Yes. The two eigenfunctions being mixed are degenerate (5? + 1? + 1? = 
3? + 3? + 37), so the linear combination remains an eigenfunction. (See Prob- 
lem 2-11.) b) Instead of integrating, we can take the value of ?AV because 
AV is small enough to make y? essentially constant in it. y? = (2/L)6 
sin? (1/2) sin? (x /2) sin? (1/2) = 8/L?; AV = 0.001 L?; Y? AV = 0.008. The 
classical value (uniform distribution) is 0.001, so the quantum-mechanical prob- 
ability for finding the particle at the center is 8 times greater than classical. (Not 
coincidentally, this is the cube of the answer for the one-dimensional analog in 
Problem 2-5.) 
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2-21. 


2-22. 


2-23. 


2-24. 


2-25. 


2-26. 


2-27. 


2-28. 


2-29. 


AE — [(64- n — (5 - n] I? /8mL? 


à = (8mcl? | h) 2n + 10)? /Qn + 11) = 637(2n 4-10)? /Qn + 10 À 
n: 0 1 2 3 
A: 5791 7056 8323 9592 


Momentum is a constant of motion if [(^/2zi)d/dx] — constant times v. 
This applies for b) and d), and the momentum equals 3^/2z and —3h/2z, 
respectively. Kinetic energy is equal to k?h?/8z?m, or 9h? /81?m for all four 
cases. 


The z component of angular momentum is a constant of motion if 
[(/2zxi)d/d$]w = constant times y. This applies for b) and d), and the 
angular momentum equals —3//2z and 3h/2z7,, respectively. Kinetic energy 
equals 94? /8zr7/ in all four cases. 


v and dy/dx are real at x =a. (v = v*),-4; Aexp(ika) + Bexp(—ika) = 
A* exp(—ika) + B* exp(ika); so Aexp(ika) — A* exp(—ika) = B* exp(ika) — 
Bexp(—ika).  [(dv/dx) — (dW/dx)*], ,; Aexp(ika) + A*exp(—ika) = 
B* exp(ika) + Bexp(—ika); adding these two equations gives 
Aexp(ika) = B* exp(ika), so A = B*. Subtracting the two equations gives 
A* exp(—ika) = B exp(—ika), so A* = B. (Or we could take the complex conju- 
gate of the previous equation.) Then, | 4|?/| B|? = 4* A/ B* B = A* A/ A A— 1, 
so |A|=|BI. 


Matching values at x = 0 gives A + B = C. Matching slopes gives kA — kB = 
K'C. Using the first of these to eliminate C from the second gives the first part 
of (2-73). Using the first to eliminate B from the second gives the second part 
of (2-73). 


Now Aiszero, B=C + D, -—kB=k'C—k'D, C/D=(k—k’)/(k+k’), B/D= 
2k'/(k +k’). 


a) |C|?/|A|* gives the relative spatial densities of transmitted to impinging par- 
ticles, but we need relative fluxes, which depend on both particle densities 
and velocities. The relative velocities are the same as the relative momenta 
(since masses are the same), and these are in turn the same as the relative 
k values. So k’/k gives the relative velocities of impinging and transmitting 
particles. For reflecting particles, the analogous ratio is k/k, so it does not 
appear. 

b) [VIC/ I 4] + [L82/14P?] = 4K'k/(k + k)? + (k — KY (Kk + Ky = 
(k 4- &'? /(k 4- k)? 21. 


100% transmission occurs when an integral number of de Broglie half-waves fit in 
Ax —d;nAX/2— d;also, 4 —h/p— h/A/2T =h//2m(E —U). Equating these 
expressions for À :4d?/n? = h? / [2m (E — U)]; (E/U) — 1 2 n? I? / (8d? mU). 
Substituting for U on the right: (E/U) — 1 —1?1?/16. When n 21, E/U = 
1.617. When n 22, E/U —3.467. (Compare with Fig. 2-18b.) 


AE, 0.805 cm" !, AE, 36cm |. 
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Answers to Problems 


MC:cdeaebdbda cddba 
Chapter 3 


3-1. 


3-4. 


3-6. 
3-7. 
3-8. 


3-9. 
3-10. 


3-11. 


3-12. 


[See Hint.] P(x) is proportional to 1/v(x), which is [dx (t) /dt]-!, which is 
[-/k/mL sin /K/mt)] |. This is proportional to 


[s imo] ^ - L - cos! (/K/mp]| " "pest t 


The normalized probability distribution function is (x vL? — x2). 


. a) x(t) — 0.100 cos(A/2t), t in seconds, x in meters. b) —0.0453 m. c) 1.00 x 


10-?J. d) 2.00 x 10? J. (e, f) 5.00 x 107?J. g) 0.126ms-!. h) 40.100 m. 
i) 0.225s7! 


T +[(n 4 12h) Nem] 


a) [V P/n)1/(Q"n D] Le is the normalizing factor. It keeps the total probability 


density equal to one. H,(y) is a Hermite polynomial. It provides the nodes 
in the wavefunction. Exp(— y?/2) forces w to decay at large values of x. It 
gives the correct asymptotic behavior. 


b) po) = V/B/z exp(—y"/2) 
ViG) = JB/n (01/2)2 yexpC- y? /2) 
Va) = J/B/n (01/8) 4y? —2)expC- y?/2) 
Hyo = [- [£^ / 827m) d? /ax? + kx? /2] (8/1) * expC- Bx? /2) 
= [4° B/x^m) — B*h^x? (8x7m) + kx? /2] Wo 
= | (h/4r) km — kx? [2 + kx? /2] vo 
= (h/2)(1/2z) / k/ m)o = (hv/2)Wo. 


[See Hint.] a) y 2 41/2. b) y 45/2. 
The first function is asymmetric; the second becomes infinite in both limits of y. 


a) Approaches zero. Decaying exponential overwhelms polynomial. b) Anti- 
symmetric. Polynomial is antisymmetric and exponential is symmetric. 
c) Value = 0, slope = 120. 


a) co — 7. b) cy =0. 


a) Not, because missing y? term should cause the polynomial to terminate, not 
permit > term. 
b) Not, because of mixed symmetry. (Powers 5, 3, 1, 0.) 


c) OK. 
OO x 


a) Zero (integrand antisymmetric). b) ./7 /2 (symmetric). 
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3-13. Zero (integrand antisymmetric). 


1/2 

3-14. a) xy =+ [34/0 m] =+./3/B. b) V? is maximum at y= 1, or 
x = Xl//B = xh/(x4 mk). c) 1/2. d) 0.0144 [from (2/e)//z times 
0.024/3h/ (2t / km)! ^]. 

3-15. [See Hint] 2/7 JB/z exp(—Bx*)dx = (2/./m) ft" exp(—y”)dy ~ 0.156, 
where a = 1 / V/B. 

3-16, 17. H5(y) =4y* —2. 

3-18. Zero point energy — 3hv; 10; 3. 

3-19. [See Hint.] The barrier requires y = 0 at x < 0 but does not affect Ê at 
x >0. Therefore, antisymmetric solutions of harmonic oscillator are still 
good. Results are V4.9 = ATO P EHE ues n=1,3,5,...,and E = (n + hv, n= 
1,3,5,...Wx<0 =0. 

3-20. a) 40. b) —1. 

3-21. V5 = Es/2— (141/4)hv 2 T5. 

3-22. For each unit of energy going into vibration, half goes into kinetic energy, which 


registers as a rise in temperature, and half is "hidden" as potential energy. None 
is hidden as potential energy in rotation or translation. 


3-23. Using nominal nuclear masses gives k values of 958,512,408,311 Nm" !, 
respectively. Decreasing bond stiffness implies decreasing bond strengths. 
(Observed Dy values are, in kJ mol-!, 564, 428, 363, 295, respectively.) 


MC:eed,cc 
Chapter 4 
4-1. E; — E; = 246731 x 10^ Hz using me. Using u gives 0.999455 times this 
value, a difference of 545 ppm. 


4-2. The radial distribution function vanishes at r = 0 because 47r? is zero there and 
again at r = oo because the decaying exponential of Y? overwhelms the finite 
power term Az 7?. 


4-3. [See Hint.] Eis = -i a.u. V (r) 2 —1/r a.u. Equal when r —2 a.u. J y?dv = 
0.238, and so 23.8%. 


4-4. a) 1/Z au. b) 3/2Z a.u. c) 0 (by inspection of y?). d) —Z? a.u. Differs 
from —Z times the reciprocal of b) because average of 1/r 4 l/(average of r). 
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4-5. 
4-6. 
4-7. 


4-8. 


4-9. 
4-10. 
4-11. 
4-12. 


4-13. 
4-14. 


4-15. 


4-16. 


4-17. 


4-18. 


4-19. 
4-20. 


4-21. 


4-22. 


Answers to Problems 


d)is lower because 1/r blows up at small r, contributing large (negative) amounts 
of potential energy. 


(1/4/2) fo. expt r)2 —r)exp( r[2)r?dr —...0. (Do not forget r? from dv.) 


7? is the normalizing factor. 
x= ES wx dx. y? is symmetric, x is antisymmetric, and x — 0. 


a) I Va(x—L /2 y, dx. b) Expect it to increase as n increases, approach- 
ing a limiting value. This because y/? favors the box center for lower states 
and approaches classical (uniform) distribution as n increases. c) L?^(1 /12— 
1/2n?z?), n = 1 : 0.03267 L?, n 2 2:0.070668 L7. Approaching L*/12, which 
is the classical value achieved if Y? in a) is replaced by 1/L. 


For reflection in the x y plane, 2p; is antisymmetric, and 3d, , is symmetric. 
a) 4/Z a.u. b) 5/Z a.u. c) 2/Z a.u. d) — Z2/4 a.u. 
[See Hint.] rjj — 2n?/Z. 


[See Hint.] The sum of squares of angular dependencies equals A Use trigono- 
metric identities to remove angle dependence. 


a) —1/2 a.u. b) —2 a.u., —1/2 a.u. c) 25 d) 2, 1 e) —2 a.u. 


a) /=m=0. (No angular dependence: must be s.) 
b) 2 (quadratic in r). 

c) —1/18 a.u. (2 nodes means it is 3s, so n —3.) 
d) 18 a.u. 


a) Yes, at r = 6 a.u. b) 3p,(rsinO sing = y, so it is py, and there is one radial 
node). 


a) Looks OK. b) No. Blows up at large r. c) No. Lacks exponential decay 
function in r. 


[See Hint.] f 1s?r sin 8 cos dv = 0 (because cos $ is antisymmetric in each 
subrange 0—z , zt 22r). The average value of x should be zero because the electron 
is equally likely to be found at equal +x positions due to the spherical symmetry 
of y’. 


[See Hint] a) 0,5 = 35°15’, 144?45'. b) Angular nodes come where 
(3cos? 8 — 1) « 0. 6 =54.74°, 125.26°. 

xy— (rsinÓ cos $)(r sin 0 sin $) = r° sin? 0 cos $ sing = (r° sin? 0 sin 2) /2. 
Equation (4-45) predicts 2/3 and 0 for these integrals. Actual integration over 
x? and over 5x^/2 — 3x? /2 gives 2/3 and 0. Integral over all space for 3po3d.. 1 
involves integral from 0 to x of P? (cos 0) P; ? (cos 0) sin0 d0, which is same as 
integral from 1 to —1 of P? 5! (x) dx, so the integral vanishes. 


[15/2102 | (1 — cos? 6) cost exp(—2i$), f2. 


L.Yis (0.0) = (h/i)(d/db)®1,m(@) exp(imó) = mhOj,,(0)exp(imó) = mh 
Yi, (0, $). 
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4-23. 


4-24. 
4-25. 


4-26. 


4-27. 
4-28. 
4-29. 
4-30. 
4-31. 
4-32. 
4-33. 
4-34. 
4-35. 
4-36. 


4-37. 


4-38. 
4-39. 
4-40. 


4-41. 


L,.2po = L.RG) cos 0 = —i R(r) sin 6 sin o Æ constant x 2po 
i y2po = i R(r) sin 0 cos $ Æ constant x 2po 
Ê1s = 0= Î,Is = L,1s— Ê’Is---=0 x ls 


Since the vector has zero length, its x, y, z components must also have zero 
length. The question of vector orientation becomes meaningless. 


[—1/(sin 0) (d/d0) sind /d0]| R(r) cos0 —2R(r)cos0, I(14- 1) 22, 1— 1. 
P4 
3 p 
2 f- 
: —] 
-1 Sy 
-2 N 
-3 
Same for 6f 
[See Hint.] Will be eigenfunctions in cases b) and e). Only these correspond to 


mixing degenerate eigenfunctions. 
a) J6h or 4/6 a.u. b) d_o. 
a) 0. b) —3h or —3 a.u. 
a) 0. b) 2A? or 2 a.u. c) 0 (same as if all x’s were z's). d) 0 (by symmetry). e) 0. 
a) 65.2.1. b) 2Vop,. c) (71/18) Vp, d) — Vo. ,. 
a) Yes, —1/18. e) Yes, 0 (same as Liap,). f) Yes, 2. All others, No. 
1.44 x 107° a.u. 
2.80 x 10! Hz, 11.4 ppm. 
a) —1/32. b) 16. c) 0, V2, V6, 4/12. d) 7. e) 4. 
2 


pre =?=m r? + mri. Chug, chug, Bingo. 


Using Eq. (4-56), we write Eq. (4-68) as (22/52) = (-21 E /ħ y; (12/21) = 
Ewy; since there is no potential term, E is all kinetic, so this completes the 
demonstration. 


[See Hint.] First find u in a.m.u., then divide by 6.0221 x 107° to convert to 
kg/molecule. u = 1.6529 x 10777 kg. r= J/1/u=1.4144 x 10749 m. 


10.976 cm-!, 21.953 cm7 !, 32.929 cm! . 
1.129 x 107? m(u = 1.1385 x 1077 kg, I = 1.4504 x 10726 kg m?). 


J —3,so m ; 23,2,1,0, —1, —2, —3. The states split into seven equally spaced 
levels. 


J3/2, or 0.866 a.u. 
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Answers to Problems 


MC:ebbed ebcc 


Chapter 5 

5-1. -À(V2 + VÀ + V3) - 3/ri —3/r2 — 3/73 M ria + 1/73 + 1/r13. 

5-2. ris — 3/2Z = (for Het) a.u., 725 = 6/Z = (for He™) 3 a.u. 

5-3. E = 2.343 x 10? eV compared with an IE of 13.6eV. It shows that A small 
enough to locate the electron with useful precision involves photons with energy 
sufficient to excite the electron completely out of the system. 

5-4. [See Hint.] fwdv = 4 fffisa?2sQY? + 2(1s(1)2s(1)28(2)1s(2)) + 
2s(0?1sQ)?dv(Ddv2) = 5(1-142-0-041-D=1. 

5-5. V Qe D = (1//2)[18(2)2s(1) — 28(2)1s(1)] = — Wa. 

5-6. Upon substitution and expansion, complete cancellation occurs. 

5-7. (1//6)[1s2p1s(aBB — æ) + 1s1s2p(BaB — o BB) + 2plsIs(8Bo — BoB). 

5-8. f 1s*2sdv f 1s*1sdv f 2s*1sdv f a*ado f B*ado f a*Bdo =0-1-0-1-0- 
0-0. 

5-9. For ri = 1, m —2, r3 =0 get 
V(1,2,0) = (1/6) [1s(r = 1)2p.(r 22)1s(r 20) + 2pz(r = D)1s(r 22)1s(r = 0) 

2pz(r = D)Is(r 22)1s(r 20) — Is(r = D2pz(r 22)1s(r =0)] 
The other cases are the same except for factor of —1. Thus, y? is identical for 
all three cases, and no physical distinction exists. 
5-10. Replace each o in the lowest row with £. 
5-11. a) aa, BB, YY, ap + Ba, ay T ya, By E yp 
b) aB — Bo, ay — ya, By — yp 
5-12. a) Yes. Antisymmetric for exchange of any two electrons. 
b) —1/2(V2 + V5 + V3) — 3/ri —3/r2 — 3] rs + M rio t 1/123 dria. 
c) No. It is a product of one-electron orbitals, hence an independent-electron 
solution, but H is not separable. 
d) (—9/2)(1 + 1/4 + 1/9) = —6.125 a.u. e) —3/2 a.u. 
5-13. Equation (5-41) with U1 = 1s, U5 = 1s, U3 =2s, U4 = 25. 
5-14. F, 1s?2s*2p°, Z 29, £15 — 8.7, £x, = bop = 2.6. 
5-15. Let Âp —aQ and f ó*ódx —1. (A), = [¢* Aódx = f pap dv —a[ o*bdt =a. 
5-16. Li?^* isa hydrogenlike ion, and hence should have all states of same n degenerate. 


5-1 


- 


Li differs in that potential seen by electron is not of form — Z /r, due to screening 
of nucleus by other electrons. Hence, degeneracy is lost. The 2s AO of Li is 
lower than the 2p due to the fact that the 2s electron spends a larger fraction of 
time near nucleus where it experiences full nuclear charge. 
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5-18. 


5-19. 


5-20. 
5-21. 


5-22. 
5-23. 
5-24. 


5-25. 


5-26. 


5-27. 
5-28. 
5-29. 


5-30. 
5-31. 
5-32. 


5-33. 
5-34. 
5-35. 
5-36. 
5-37. 


n electrons of o spin give one state. Each time one o is changed to a f we get a 
different state. There are no spins available to change. 


Symmetric combination gives e? + 8e either way. Antisymmetric gives e? — 8e 
one way and — (e? — 8e) the other. 


a) 251/21 2 states). b) 73 /2,7P3/2,7S1/21 8 states). 


a) Not satisfactory since electron 1 is identified with the 1s AO, etc. 
[1s(1)3d2(2) — 3d2(1)1s(2)]a (1)a (2). 
b) ?Ds. 


120. 

a) ?P3/2,?pi2. b) 2. c) 4, 2. 

a) a(1)B(2) — B(1)e(2). b) —22.5 a.u. c) —22.5 a.u. + Jis», + Kis,2~,- 
d) Eigenvalue for S? is 0. 

All in a.u.: a) 6. b) 2. c) 12. d) 2, 1,0, 21, 2. e) 1,0, 10.  3,2,1,0,—1, 
—2, —3. 


a) In both cases, maximum net z components are: spin = 1, orbital= 1. 

b) ?Po below ?P, below ?P» for p case, reverse for p^. 

c) Each pairing reduces multiplicity to m + 1, where m is number of unpaired 
electrons, which equals the number of holes. Each pair's contribution to Mr 
due to their orbital's m; value is either totally canceled by a pair at —m; or 
else half-canceled by an unpaired electron at —m;. Therefore, the uncanceled 
portions reflect the presence of unpaired electrons, hence holes. Therefore, 
the amount of uncanceled m; is equal to minus that which we could assign to 
the holes. 


a) 81/2. b) ^85. c) 1S9. d) 5E,. 
a) 12. b) 3Po, ?P, ?Po, ! P4. 


a) 2P 1/2 below ?p3 5. b) 4853/2 below °D3/2, ?Dsp below ?P15, ?P3/. (Rules 
do not allow us to sort by J in this case since shell is exactly half filled.) 


Striving for maximum multiplicity means avoiding pairing electrons. 
a) 15. b) 35. 


No. Impossible for doublet and singlet combinations to arise from same number 
of electrons. Each pairing reduces number of unpaired electrons by two, so 
allowed multiplicities are all even or all odd. 


a) 20. b) 12. c) 6. d) 60. e) 100. 
a) 28. b) “Fo/2, *F7/2, “Fs/2, *F3/2. 
3D3. Also 7Do, °D]. 

2g? — g, 15, 45. 


Splittings equal 9.274 x 107?4 J times g, with g equal to 4/3, 7/6, 1/2, 1, respec- 
tively. 
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MC:deed 
Chapter 6 


6-1. [© _y* (d /dx? pdv = [S d(d!Jdx?)y*dv. Use fS vdu = uv% — 
fs, udv. On the left, let v = Y*, u= d$ /dx, du =d°ġ/dx?, dv 2 dy* /dx. On 
the right, let v = ġ, u —dvy* /dx, etc. The uv term vanishes since w* and $ each 
vanish at limits. The remaining integrals are identical. 


6-2. Each equals SAS /27. 
6-3. f Vy *ydv =], f x? xjdv = ĝi j» V = YXicj Xi. Then 


| mmi [xitix egt xime zia. 


QED. 
6-4. y = Xiciui, f utp jdv = 6i j, want cy: 


| vivae | iziesatv- zie | uinidv- zit =c 


2x 2x 
6-5. a) a/m) f cos 29 (h / i) (d/d$) cos26d$ = -0/hniy | cos 29 sin 26d $ 
0 0 


27 
x sym - antisym = 0 


0 
V-0//2|a/z0expQio | a2 [aj exp(—24) | 


Terms in [] are normalized eigenfunctions of L; with eigenvalues of +2/ and 
—2h. So (L)a, = (1/72)? Qh) + (1//2) (—2h) = 0. 
6-6. [x px] =[x(h/i)(d/dx) — (h/i)(d/dx)x1 fo) = AD (xf — f xf) =h 


Ax: Aps > z [V oov =1-A/2i1=/2 


6-7. p must be identical to the eigenfunction yo. 


6-8. No. The existence of some real eigenvalues does not guarantee that the operator 
satisfies the definition of hermiticity: 


(d/dr) exp(—ar) = —aexp(—ar) 


but 


j! ee yc exp(—br)r?dr £ i eee er Cua e d 
0 0 
if azb 


6-9. a) Each side equals —(1/24/2)yri exp(it/2) — (1/8 /2) Wo,9 expGt /8). 
b) [See Hint.] (W*W),-¢9 = (1/2)(1s? + 21s2po + 2p2); O*P) = 
(1/2) (108? + 2cos(3/8v)1s2po + 2pp). Equal when 3/8v —27; v 23/167. 
AE = (1/2)(1 — 1/4) 2 3/8 au.; AE = hv —-2zhv/2nz = 2nvh = 2zv in 
au. v= AE/27z = (3/8)/2n —3/167. 
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6-10. 
6-11. 
6-12. 
6-13. 


6-14. 


6-15. 
6-16. 


6-17. 
6-18. 


6-19. 
6-20. 
6-21. 
6-22. 


6-23. 


6-24. 


a = NE DA 
[See Hint.] S = /3/2, and so 9 = (2//3z)(r — å) exp(—r). 
a) 84/8/27 =0.838. b) 0 (by symmetry). 


[See Hint.] a) Py Wo = —(BAx/i)Wo A constant times Yo. Momentum is hence 
not a constant of motion. 

b) f Vo PxVodx — —(Bh/i) f Wgxwodx = f (sym) (anti) (sym) =0 (Must be zero 
since otherwise motion in one direction would involve greater momentum than 
motion in the other.) 


First pair: No. x?(d?/dx?) + x? (d? /dx?) z 2x(d/dx) + Ax?(d? /dx?) + 
x?(d?/dx?). Second pair: Yes. Both arrangements give 2x?(d^/dx?) + 
x? (d? /dx?). 


a) 6 a.u. (since / = 2). b) 0 (since real form of y involves equal mix of +77). 


a) Because these operators commute, yes, there must be a set of simultaneous 
eigenfunctions. 

b) Mixing degenerate-energy cases gives functions that remain eigenfunctions 
for the energy operator but not for the momentum operator. (These are the 
sine and cosine versions.) 

c) Exactly knowable because knowledge of momentum gives knowledge of an 
eigenfunction which in turn gives knowledge of energy. 


E = (1/3)(—1/2 — 1/8 — 1/18) = —0.2268 a.u. 


[See Hint]. Normalized y = 0.26726[1s + 2(2p1) + 3(3d2)]. (È) = 
0.267267[0 + 4(1) +9(2)] = 1.571 a.u. 


Proofs are in Sections a) 6-8, b) 6-9, c) 6-11. 
Iw? = 0/2) (a? + Ioa? + 2| yi y2|cos[(E2 — E1)t/A]}. 


Cycle time =m L?/ h 2 m L? /2x a.u. 

Cn = (2V2/L) qu sin(2z x / L) sin(nz x / L)dx. 

a) /2w and V are same function in range 0 < x < L/2, guaranteeing c? = V2. 
No other c can be larger or the resulting function will give a total probability 
density greater than 1. 

b) v4 and Y have opposite symmetry for 0 € x < L/2. 

C) As vj becomes more oscillatory, the positive and negative portions of its 
product with W will cancel more effectively. 


a) cı —0.838, c2 = 0.2048. 

b) Whereas only s-type AOs appear in Eq. (6-41), explicit account of changing 
potential would yield a nonspherically symmetric potential so that p-type AOs 
would enter too. 


[See Hint.] c; is small if there is effective cancellation between positive and 
negative portions of the product of exp(—ax?) and cos(kx). Larger k makes 
cos(kx) more oscillatory and makes cancellation more effective. The broader 
exp(—ax?) is, the more effective is this cancellation for a given value of k. 
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6-25. a) Real V means real Ê. AY (x, y, z,t)=—(h/i)(0/dt) W(x, y, z, t). Complex 
conjugate of equation gives AW*(x, y, z,t) = (h/i)(8/8t) W* (x, y, z t). 
Transform t — —t throughout (does not affect equality), then recognize 
9/8(—t) = —8/8t, so HW*(x, y, z, —t) = —(h/ i) (8/80) W*(x, y, z, —t). 
Q.E.D. 

b) W(x, y,z,t) becomes W(x, y, 2exp(—i Et/R). W*(x, y,z, —t) becomes 
w*(x, y, z expli E(—£)/h] = v*(x, y, z)exp(—i Et/h). Carrying these 
through Eqs. (6-3,4,5) shows that Ê y* = Ey*. 

c) If v and y* are independent, we have two solutions with the same energy, 
i.e., degeneracy. If E is not degenerate, y and w* are not different, so yy = y* 
and is real. 

d) 2p_; becomes —2p..,. No change for 2po. 

e) No. Whileall complex eigenfunctions must be degenerate, real eigenfunctions 
can be degenerate too. Cases we have seen are (1) accidental degeneracies 
such as W333 and ws, 1 in the 3-dimensional cubic box and (2) real eigen- 
functions constructed as linear combinations of complex ones, as 2p, and 2p, 
from 2p, and 2p. 4. 


MC:cad 
Chapter 7 
7-1. a) Yes. ci = J2/L fo’ f(x) sin(rx/L)dx, c» = J2/L f f )sinQzx/L) 
dx. cy =O by symmetry. c» is positive. 


b) No. Both a) and b) are continuous, smooth, single-valued functions, but b) 
does not go to zero at x = 0, L as do all the box eigenfunctions. 


7-2. a) 


b) 1.3h7/8mL?. 
7-3. cı = v 0.4 = 0.632, c3 =0 by symmetry. 
7-4. a) [See Hint.] For n = odd, 


L/2 4/6 oo 


=2 dx=+ 
Cn i ow x n2712 —forn = 3,7, 11, 15, nm 


For n — even, c, — 0 (by symmetry) (i.e., $ is symmetric and so contains only 
symmetric Yn). E 
h) Papprox (x = L /2) = Vo = 1/2) 


n=1 


= (4V6/n7)/2/L M m) 


n=1 (odd) 
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7-5. 


7-7. 


7-10. 
7-11. 


7-12. 


7-13. 


m: 1l 3 5 J 9  .. 135 
VLéapprox(x = L/2): 1.40395 1.55994 1.61609 1.64475 1.66208 1.72689 
VLo(x =L/2) 2 43 = 1.73205. 

c) [See Hint.] 


E = V aVe[n^n?y (^ I? [Sn L^) = (12h? [n*m L?) V ^ (m) 


odd n odd n 


< arem?) a amy | wpe] 
odd n=1 2 m+1 


= [(? /8m L?) (96/24) (1.23386) |m—9 
= [1.21602(h?/8m L?)],,—9; [1.21432(h?/8m L7)|m=135 


Normalizing factor = Qa/n) ^; E = (3a/2) — 24/2a/ S/T; a = 8/97; 
E (min) = —4/37 = —0.4244 a.u.; r = 1.5 a.u.; rmp = V 97 /4 = 1.329 a.u. (For 
Vexact, E = —0.5 a.u., r = 1.5 a.u., rmp = 1.0 a.u.) 


. a) and b) See text and Eqs. (7-16)-(7-20). c) a= i E = —0.370 a.u. 


a) 1/4/8. b) —0.292 a.u. 


. a) Zero, because the function is symmetric for xy reflection whereas 2p; is 


antisymmetric. b) —0.4215 a.u., (by assuming c3, is V0.05 and all higher terms 
vanish). 


. a) £ 25/3, E=—0.2777 a.u. 


b) cı — 0.897. 

c) x =2.26(¢ — 0.8975). 

d) No. x cannot have E lower than the n = 2 value, which is the lowest-energy 
case orthogonal to n = 1 and which has E = —0.125 a.u. 


—0.75 a.u. 


[See Hint.] The energy E = —0.375 a.u. = (0.9775)? (—4 a.u.)+higher-energy 
contributions. But this leading term equals —0.478 a.u., and so the net value of 
the higher energy terms must be positive. Therefore, at least one of them must 
correspond to a state with positive energy—a continuum state. 


Slater's rules give ¢ = 1.7, whereas the variation method gives ¢ = 27/16 = 
1.6875. 


a) Su =1, $5 20, $5 51, Hi =-—4, Hi; -F, Hy =0. E--1—- 
iv 1+16F2; for F=0.1, E = —0.51926 a.u. This trial form is superior 
because z- Vj, is more contracted than v»p,, closer in size to Wis, hence 
interferes constructively and destructively with y/;, more effectively. 

[See Hint.] lim(F > 0) of /1-- 167? 21-- 87?, in lim, E- —5 - 2F?, 
—jaF? 2 —2F?; a —4. E(e?/ag) & o F? (o units) -(e/a2)^, æ units = ag. 
(See Appendix 10.) 


b 


wm 


. Since 2s is isoenergetic with 2p states, these should mix freely in response to 


field. Hence, 2s is more polarizable. 


668 


7-15. 
7-16. 
7-17. 


7-18. 
7-19. 


7-20. 


7-21. 


7-22. 


7-23. 
7-24. 


7-25. 
7-26. 
7-21. 


7-28. 


7-29. 
7-30. 
7-31. 
7-32. 


7-33. 


7-34. 


7-35. 


Answers to Problems 


E(min) = —12/5 = —2.4 a.u. y = /2/5(ba + op). 

[See Hint.] E (lowest) = —2.030 a.u. y = 1.0459, — 0.17995. 

[See Hint] Eas = (67/2) — 2 + 2(6 + D) exp(—2); ¿vest = 0.9118, Eetec = 
—0.9668 a.u.; Eec + 1/ R = Erot = —0.4668 a.u. Since this energy exceeds 


that of H+ H* (—0.5 a.u. at R = oo), this function does not demonstrate the 
existence of a bound state. 


10u becomes a 2p AO of Het, so E = —0.5 a.u. 


[See Hint.] Since they are degenerate, Y+ and w_ may be mixed. The sum gives 
Isa, describing the case in which the electron is at a. The difference gives Isp. 


[See Hint.] Both equal —3 a.u. This is higher than the lowest He* eigenvalue 


because these are hydrogen atom 1s functions instead of He* functions. 
k must be greater than S. 


a) H=—35V? — (1/ra) - 2/rge. 
b) Separated atoms: lowest energy for Ht + He* (1s) = —2 a.u. For the united 
atom; Li^* (1s) = —4.5 a.u. 


a) zt, bonding. b) oy antibonding. c) og bonding. d) ôg bonding. e) zy bonding. 
Isog 2pzou 3pyzu 3d,,8. 
2sog 2pxzu 3d20g 3dxzTg 

a) Antibonding. b) Bonding. c) Bonding. 


2px, 2Py, 3px, 3py, 3d, ,, 3dyz. 


[See Hint.] The integrals that vanish by symmetry are b), c), e), and f); g) does 
not vanish. The AOs are orthogonal due to different symmetry for reflection in 
the xy plane at a. But Å is not invariant to this reflection. In f), the relevant 
reflection is through the yz plane; A. is invariant to this one. 


Sketches show that several planes qualify, but symmetries are opposite no matter 
which is chosen. For instance, if the xz plane is selected, 5,2_,2 is symmetric, 
dxy is antisymmetric, Xyz is symmetric, Xyz antisymmetric. 


0. (It is ao MO.) 


y 


a) oy antibonding. b) zy bonding. c) zr, antibonding. d) dg bonding. 
a) 4. b) 3 (a triplet). c) (1) increase, (2) decrease. d) 2 (a doublet). 


a) 1071a:267204302 Im, Ix. b) 5. c) Of has larger Do. d) 7Mg. 
e) Allo MOs. 

For He», the second MO (o,y1s) correlates with third united atom AO (2po). For 
LiH, the second MO (o) correlates with second united atom AO (2s). Thus, this 


MO is less antibonding in heteronuclear case. 


[See Hint.] 2s’ = 1.0295 2s — 0.2447 1s; 20, = 0.0136 1s 4 — 0.6523 2s", — 
0.0854 2p,,4 and similarly for B. 


Eelec = Esepatoms — Va, — De; De = —Eetee — Van + Esepatoms = (1.1026 — 
0.500 — 0.500)a.u. — 0.1026 a.u. 


Appendix 13 669 


MC: 


aadbd 


Chapter 8 


8-1. 


8-2. 


8-3. 


8-7. 


For Wproa, E = E1 + E2 + £3. For Yaet, E = He + E» + E3)six times. Energies 
of products in y are E, + E2 + E3 and E, + E? + E4. These can be factored out 
to give Hy = Ew only if £3 = E4. 


a) x 10000 

td tx 1010 

| | 0 1x 100 

4—5 00 1x10 

^6 0 10 1x I 

0000 1 x 

b) x 11000 

| 5 1x 1000 

Psa] |b ix too 

d 6 00 ! x I I 

000 I x I 

000 1 1 x 
x 1 

c) (only 2 unsaturated carbons, so the same as ethylene). 

x 


d) Same as c). Same as two ethylenes since the two zr systems are noninteracting 
due to spatial separation. 

e) Same as c). Same as two ethylenes since the two zr systems are orthogonal 
and noninteracting. 


—0, x*—3x?-0 See Appendix 6 for results. 


lE EE 
oO x — 
on OF 
= OOF 


: i since only x3 is common to both MOs. 


. S=1/V2, 1/71 — S? = V2, $5 (normalized) = /2(¢2 — $1/4/2). This results 


in all |c| = 1/2, with negative values on same side. That is, the node for $5 is 
vertical, while that for $, is horizontal. 


. a) a — 26 (using octagon in circle). 


b) 


(since totally antibonding). 


a) E—a -28(4/2/34/3) +0+0+0)=a+ 0.5448. 
b) pı2 — 0.272. All others zero. 
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8-8. 


8-9. 


8-12. 


8-13. 
8-14. 


8-15. 


8-16. 
8-17. 


8-18. 


Answers to Problems 


c) E=a+ B. (It is an ethylene pi bond, distributed over three locations, or 
three-thirds of a double bond. Or one can use a = 1/4/6 and equations for p 
and E.) 


Highest two levels have same magnitudes as lowest two, with coefficients 
multiplied by —1 on alternating atoms (here taken to be second and fourth). 


: NONO 


a) No. This is an odd alternant, so expect paired energy levels with an MO at 
E — a. The unpaired electron is therefore not in a degenerate level. b) Yes. 
Pentagon in circle forces degeneracies. Second level has three electrons. c) Yes. 
Square in circle shows that second level is degenerate. It contains one electron. 
d) Yes. The degenerate level of benzene now has an odd number of electrons. 


Bond orders: CH5-CH, 0.8944 — 0.6708; CH-CH, 0.4472 — 0.5854. Bond 
lengths in A (Eq. 8-61): CH2-CH, 1.354 1.392; CH-CH, 1.436 1.408; 
ACH;-CH = +0.038A, ACH-CH = —0.028 A. 


For benzene, c,; should be taken as 1/ /6 since all carbons are equivalent. 


Only fluoranthene deviates markedly because it is nonalternant. Hence, its 
LUMO and HOMO energies are not symmetrically disposed about E =a. 


a) Oxidation potential ~ 0.97 V, reduction potential ^ 1.41 V. b) [See Hint]. 
To shorten: 4-10, 9-10, 8—9; to lengthen: 3-10, 1—9, 4—5, 7-8; otherwise no 
change. 


E = 18a + 21.8778. Error = 0.00158 per x electron. 


a) Naphthalene; E; (from Table 8-2) = 10a + 13.1288, from HMO = 10a + 
13.68326. The difference = 0.0558 per x electron, aromatic. Perylene; 
E4 (Table 8-2) = 20a + 27.27966, E; (HMO) = 20a + 28.24538; AE, = 
0.0486 per x electron, aromatic. 

b) RE for perylene is slightly less than double that for naphthalene. The central 
ring does not appear to be contributing. 

c) These two bonds are single in all formal (nonpolar) structures. 

d) The calculated length — 1.433 À. The HMO length is too short. The actual 
length is more consistent with these being "truly" single bonds. 


The fourth molecule in Fig. 8-24 should strive for six electrons in each ring. This 
would make the left side (i.e., the seven-membered ring) net positive. The other 
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8-19. 
8-20. 


8-21. 
8-22. 


8-23. 


8-24. 


two molecules become net negative on the left. Electron densities corroborate 
this. For the fourth molecule, charge densities exceed unity in the five-membered 
ring and are less elsewhere. The fifth molecule has only one z electron density 
less than unity, and this is for the methylene carbon. 


a) 9. b) 10. c) 4. d) 6. e) 10. 


Since the formal structure always shows C—O single bonds, C4—C», and C3=C4 
double bonds, and C2-C3 as single, we can use the single-, double-bond distinc- 
tions of Table 8-3. These give 


x ll 0 0 0.8 0 0 0 
ll x 0.9 0 0 0.3 0 0 
0 09 x—0.1 1.1 0 0 0.8 0 
0 0 1.1 x 0.8 0 0 0 
0.8 0 0 0.8 x+2.0 0 0 0 
0 03 0 0 0 x+1.5 0 0 
0 0 0.8 0 0 0 x—01 3.0 
0 0 0 0 0 0 30 x-—0.5 


Otherwise, the positions with 1.1 and 0.9 become 1.0. 
a) Left. b) Left. c) Right. 


qr = 1 at all centers, and so it does not predict some centers best for nucle- 
ophilic, and hence worst for electrophilic substitution, so it doesn't apply to 
this question. Since the HOMO and LUMO have identical absolute coefficients 
(by the pairing theorem), the same site is most favored for both nucleophilic 
and electrophilic substitution. L, must be identical for nucleophilic, radical, 
or electrophilic substitution because an interrupted even alternant produces an 
odd alternant, for which cationic, neutral, and anionic m energies (f part) are 
the same. Thus, HOMO and LUMO indices are consistent with coincidence of 
active sites for nucleophilic and electrophilic substitutions, and L, is consistent 
with the coincidence of these with active sites for radical addition. 


No. Both types should prefer the most polarizable site, since that is the site to 
which charge is most easily attracted or from which it is most easily repelled. 


a) Fi = 0.0684, Fo =0.4618, F4 = 0.9737. 

b) Index Values Preferred site 
qr q2 —0.818 q4 — 1.488 4 
HOMO  c2-0.1356 cz = 0.6646 4 
LF Lj =2.1346 L$ :=0.9628 4 
Wry 122 = —0.4340 T44 = —0.4019 2 


c) Only protons on C» and C3 will produce ESR splitting in simplest theory, 
since the singly occupied MO of the radical anion is zero elsewhere. d) Net 
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8-25. 


8-26. 


8-27. 


8-28. 
8-29. 


Answers to Problems 


bonding, because energy is below E = a and this happens only when bonding 
interactions dominate. 


The correlation is fairly good except for styrene, which is way off. But styrene 
is the only member of the set where addition is not occurring at a ring posi- 
tion. Because the geometric constraints are so different, the relation between 
free valence and transition-state energy is presumably rather different for 
styrene. 


The five-membered ring because, as it strives for six electrons in order to satisfy 
the 4n + 2 rule, it becomes negatively charged. (The seven-membered ring also 
strives for six electrons, becoming positive.) 


Fulvene a) should experience greater change. It is not an alternant hydrocarbon, 
whereas benzyl b) is. The MO into which the electron goes in benzyl is necessar- 
ily nonbonding, with zero coefficients at positions 1, 3, and 5. (See Appendix 5 
for discussion, Appendix 6 for coefficient values.) 


a) 11. b) 2 and 4. c) 3, 6, and 8 (1, 9, and 10 have no attached hydrogen). 


a) is easier to ionize because the HOMO is zero at the nitrogen position (in the 
all-carbon analog). This means ionization does not remove electronic charge 
from the more electronegative atom in case a) but does in case b). 


MC:cb 
Chapter 9 


9-1. 


9-2. 


9-4. 


9-5. 


9-6. 


3i+16 2i+28 
25 1 1 
aana s Poo] us e Pleat 9 sl ai 51 
Ta 7b Te i—7 18 0 1 12 21 


f) product not defined g) 1 


H is defined to be hermitian if H;; = Hi. H ji = f x; Ĥxidt, and so Hi = 


Lx Hr x; dc. But if H is hermitian, this must equal pe É x jdt =H,;. There- 
fore, H ; ^ Hij and H is hermitian. 


AC — CA. But CA = ÁC, so AC = AC. This must be true in this example 
because A and C are symmetric. That is, A =A, C=C. 


a) |B — A1] = IT! AT — à;1| = [T7! AT — AT HT] = [T (A — Aj)T] = 
[T^ IA  Aj1IITI 2 ITT (A = A/)] 2A = A] 

b) For diagonal B, value of |B — 411] is product of diagonal elements. For this 
to vanish, at least one such element must vanish. This will occur whenever A; 
equals a diagonal element of B. Therefore, the latent roots are the diagonal 
values. 


If a latent root is zero, then the product of latent roots is zero. But this product 
is the value of the determinant of the matrix. If the determinant of the matrix is 
zero, there is no inverse. 
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9-8. 


9-9, 


9-10. 


9-11. 


9-12. 


9-13. 


P -(ABC) = Yaso,- = » 2 3 aj jb jkCki 
= Y o criaijb jx which is $ (CAB)w = tr(CAB) 
= Y Y Y b jeckiaij which is SEA jj =tr(BCA) 
= Y Y Ya jidij) which is - ) (CBA);;, hence # tr(CBA). 
n 


ij 
b) tr(T-! AT) 2 tr(TT- ! A) = tr(A). 


(norm TAT)? = Y AD, jT AD ji = M TAT); (TAT) ji 
i,j i,j 
= Y Or DUO 100001 
i,j,k,l 


=) | AUAA Dau) yj 
k,l | i . j 
1 
= 3 Du (A) = (norm A? 


k,l 


a) tr=0, det=2, norm = y6; therefore,a+b+c=0,abc=2,a* +b? +c? =6; 
solutions: 2, —1, —1. 
b) Solutions 1, 1, —1. c) 0, 1 + /3, 1 — 4/3. 
3cos0 ; : 
Both vectors transform to 3snel Hence, reversal is not possible and trans- 
—3sin 
formation is singular. This is verified by fact that the determinant vanishes. 


The matrix is already diagonalized. This means the eigenvector matrix is the 
3 x 3 unit matrix. 


Let T7! AT = D4 (diagonal) and T^! BT = D (diagonal). Then D4Dg =DgDy 
(diagonal matrices commute); T-!ATT-!BT = T-!BTT-!AT; T-!ABT = 
T-!IBAT; TT ' ABTT ! = TT 'BATT !; AB=BA. 


In the second case, C is not unitary, since C'SC — 1. The ordinary procedures 
for diagonalizing H have built in the requirement that C'C = 1. The problem 
would be to find a matrix C that simultaneously diagonalizes H and satisfies 


CISC — 1. 


] $62 (10) ()-* IEEE 0) (') =1 


674 


Answers to Problems 


Chapter 10 
10-1. z 
UM 
1C4—— 0, —> x 
H 
10-2. AO no. Atom AO type AOno. Atom AO type 
1 Hı Is 6 C3 2py 
2 H5 Is 7 O4 2s 
3 C3 2s 8 O4 2px 
4 C3 2px 9 O4 2px 
5 C3 2px 10 O4 2py 


10-3. 


E = —0.756 a.u., MO 9 
$9 = —0.27 1sı — 0.27 1s2 — 0.49 2sc + 0.22 2pxc + 0.31 2sọ + 0.33 2px0 


A o MO, mainly C-H, bonding and lone pair (nonbonding) on oxygen. Shows 
some C-O antibonding character [see I). 


E = —0.611 a.u.; MO 8 


A o MO, CH» and C—O bonding [see (II)]. 


E — —0.597 a.u.; MO no. 7 
$7 — 0.24 2pzc + 0.92 2pzo 
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10-4. 


10-5. 


10-6. 


10-7. 


10-8. 


10-9. 


10-10. 


A x MO, mostly on oxygen, but somewhat delocalized to give some C—O 
bonding [see (IID]. 


The z MOs are 4 and 7. All others are ø. 


C and O 2p; AOs are 4 and 8. The 4, 8 overlap population is seen from the data 
to be 0.1936. Since MO 7 is C-O bonding, loss of an electron should cause the 
C—O bond to lengthen. 


E1 = (0.2456)? (—10.67 eV) 4- (0.9181)?(—15.85 eV) 
4-2(0.2456) (0.9181) (1.75) (0.2146) (—10.67 eV — 15.85 eV)/2 
= —16.25eV = —0.5972 a.u. 


The sum of the elements in the upper triangle = 12. (Use of all elements would 
count overlap populations twice.) 


If column 7 is the gross populations of MO no. 7, then it should turn out 
that 0.2175 = 2 [c]; + (0.5) 2)ca7087 54s]: 2 [(0.2456)? + (0.2456) (0.9181) 
(0.2146)] = 0.2175. 


These must be AOs because MO charges must be 0, 1, or 2. AO 4, for example, 
gets its charge from MO 7. We have just seen (previous problem) that this is 
0.2175. (AO 4 also appears in MO 4, and the "charge matrix" gives a value of 
1.7825 for this. But this does not appear in the gross population because MO 
no. 4 is unoccupied in the ground state configuration.) 


The net charges are the AO charges plus the nuclear charges (after cancellation 
of some nuclear charge by inner-shell electrons). These results indicate high 
polarity, with oxygen being the negative end of the dipole. The predicted 
polarity is unrealistically high because EHMO neglects interelectronic repulsion 
which would tend to counteract such extreme charge imbalance. 


10-11. Number MOs = number AOs = 1 on each H and 4 (valence) on each C = 22. 
Chapter 11 
11-1. Koopmans ASCF Experiment 


11-3. 


11-4. 


2s—> 2p 1.0800 1.0830 0.989 
ls—> 2p 31.9220 31.1921 31.19 


Koopmans-SCF (eV) SCF — observed (eV) 
(electron relaxation) (electron correlation) 


?B, 2.71 —1.54 
2A] 2.52 —1.40 
2B> 1.86 —0.90 


For electron affinities, these errors should reinforce, rather than cancel, because 
adding an electron should increase electron correlation. 


ad —cb+ (af — be) — ad + af — bc — Abe. 
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11-5. 


Answers to Problems 


Neither Yı nor y» is already the best function in our function space. Hence, 
we cannot argue that mixing will bring no improvement. 


11-6. This must be true to enable a; to be factored from the expanded form of Ay. 


11-7. 


For a given choice of basis functions, there are two integrals: 


Oa (Dx Dlx Axa) and (xa (xs Q)DExa (D xe Q2) 


There are five ways to choose a function for each position. Thus, the number 
of integrals is 2 x 5^ — 1,250. 


11-8. a) and c) would be prevented from contributing. 


, pees : 1 | 1 | 8 | 9 1 
11-9. Seco > (—4 pe 9 2-7 


10 


Cu CH im fio 


11-10. a) (1//2)]og(1)9g(2). b) Eetec = —1.804 au. c) Etot = —1.090 a.u. 


d) De = 0.090 a.u. e) IE (Koopmans) = 0.619 a.u. f) KE + Vne = —1.185 a.u. 


Chapter 12 
1 

12-1. Ey + Wy) = (| Holy) + (TW) = (LH) > Wo. 

12-2. [See Hint] EP > EP > ES”. 

12-3. a) ô. b) 5. c) —8/2. d) 8/2. e) —5/2. f) 0. 

12-4. a) Ws" —38/4. 

(b-1) Expect ey to cause y to shift to right. Since v» is positive on left of 
box, negative on right, e should be negative. Since y» is above yi, it should 
cause energy to be depressed, and eS leads to a negative contribution to wo. 
(b-2) Because d = —c\), ei» must be positive, giving a gs” that causes charge 
to shift left and a contribution to WS? that is positive. 
TENE ; 
12-5, ie (Qa LH" | ra) - (2/L) fy singcx/L)(Ux/L)sin(4zx/L)dx E 64U L? 
E; — E4 —15z2/21? 15374 
1 
QD. 2UL? CE S $$ 
2E LU EEUU. EM 
21 

12-6. The effect is zero, to first order, because y? is symmetric for every state and 
the perturbation is antisymmetric. 

12-7. [SeeHint.] E = Eo + E1 = jau. + (1s| l/r|1s) 2 —3 a.u. Correction seeks 
to make v less diffuse, yet still spherical. Coe ış Should be positive (augments 
hydrogen 1s at small r and cancels at large r) and C2p, s 1s“ should be zero 
(wrong symmetry). 

12-8. W + Ww) 2 —2 a.u. +2 a.u. 2 0. Reasonable, since must be above —0.5 


a.u. (See Problem 12-1.) 
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12-9. 


12-10. 
12-11. 


12-12. 


12-13. 


12-14. 


12-15. 


12-16. 


12-17. 


a) Zero, because the perturbation is antisymmetric for x, y reflection, while 
the 1s function is symmetric. b) oe ış Vanishes. The relevant integral suffers 
the same symmetry disagreement as in part a). Or, in another view, the 2s AO 
will affect only the diffuseness of y, whereas the perturbation affects polarity. 
nee , should be positive. The perturbation lowers the potential for positive z, 
the wavefunction should skew that way, so 2p, should enter with its positive 
lobe reinforcing 1s at positive z. c) It is negative, reflecting the lower energy of 
the electron distribution as a result of first-order polarization. 


Eq = 4a 4- 4.9624 + (0.1)(q4); q4 = 1.4881; Ez =4a + 5.11128. 


The effect is least at Ce since q6 is smallest. First-orderresult: E —6o +7.7776. 
Computed result: E = 6a + 7.85468. 


Oye = 41 i y» cies Ej — Ej). This must be negative. dq must 
be negative if 0o is positive. This means that making atom X less attractive 
causes electron density to decrease there. This makes sense. 


Since the total charge must be conserved, the sum of all atom-atom polarizabil- 
ities, including 71,ı must be zero. Since the sum over the polarizabilities in 
Example 12-5 is 0.44277, 7x1 4 = —0.44277. 


a) W™® — hf (because all q = 1). 
b) The lowest-energy one, which is the MO where the central atoms find the 
largest |c|. 


a) Er —9a + 12.11188. 

b) Atom 1 seems likely to have the greater self-atom polarizability because, in 
the cation, the product of squares of HOMO, LUMO coefficients vanishes 
at atom 2. 


[See Hint.] For butadiene: A E; —48[(0.3718)? — (0.6015)?] = —0.8948. 
For hexatriene: A E; —45[(0.2319)? — (0.4179)? + (0.5211)?] 20.6038. The 
energy of hexatriene is lowered, that of butadiene is raised, and so hexatriene 
benefits. 

For cyclobutadiene: Ey + E? = 4a + 3.5778, E (Hückel) = 4o + 4.0008. 
For benzene: Ey + E? — 6o 4- 7.591, E (Hiickel) = 6o + 8.0008. 


a) ME NS 
js by (c/2)8 
c=0 c= + 
H'|yi) (VaL H'|yri) 
b D (w2| | 
fi Ej— E» " Ei — £3 "i 


Since H' depends only on density at C2, it comes out of the integral. [See 


av).] 
pP =04 (—4cB/2V2B)W3 = —0.1768cy3, oP =0 
$i? = -0.1768cy, 
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12-18. 


12-19. 


12-20. 


Answers to Problems 


EZ ^ 5 ET 2 
«v $ à g (since 6,9 = 0) 
pa ix ? di 9 2 


(IV) 


Thus, vy + oy” has more density at C2 than did yi, Y2 + gs” is identical 
to Wo, and y + gs has lost density at C2. 


71,2— —0.17688-! ; 13> —0.265p-!. Both atoms 2 and 3 lose charge, but 
atom 3 loses more than atom 2. 


a) Nondegenerate MO, $4 = (1/ V3) Gn + x2 + xa) is already correct. The 
correct zeroth-order degenerate MOs must give zero interaction element with 
H'. For this H', this means that the overlap between these MOs must be zero 


at Co. Thus, one MO must have a node at C5 [ø = (1/4/2) On — x3) | and the 


other must be orthogonal to it le: = (1//6)(2x2 — x1 — x)|. 

If one uses data from Appendix 6, one obtains, for degenerate MOs, y, = 
—0.8165 xy, + 0.4082 x2 + 0.4082 %3, and y» = 0.7071 x2 — 0.7071 x3. These 
give Hj, — 0.1666cB, H5, — 0.5cB, Hj, = 0.2886cB. Since Hj, #0, one 
knows yı and v» are not correct zeroth-order wavefunctions. Solving the 
determinantal equation gives E; = 0, E» = 0.6666c8. These are the first- 
order corrections to the energies of the two upper levels. Solving for coeffi- 
cients gives, for E = 0, cı = —0.866, c» = 0.500, and so the proper zeroth- 
order wavefunction having a zero first-order correction is 9 = —0.866y + 
0.500 y; = 0.7071 xı — 0.7071 x3. For E = 0.6666cB, c, = 0.500, c» = 0.866, 
andso pP = —0.4082 x + 0.8165 x2 — 0.40823. These are the same functions 
arrived at intuitively above. b) For the above zeroth-order MOs, the densities 
at C» are respectively 1, 0, Z. Therefore, the energy of the lowest level drops 
by cß/3, that for one of the originally degenerate levels drops by 2c6/3, and 
the other level is unaffected (to first order). 


o 4- 2.1 B, o 4- 0.8188, æ 4- 0.6188, æ — 1.4188, œ — 1.6188. 


12-21. Answers to part a) and b) follow answers for parts c) and d). 


c) E =a €: 28(0.5)?(2) 2a + B. These agree exactly with benzene orbital ener- 
gies, which is reasonable since the unperturbed fragment nonbonding orbitals 
combine to give exactly the benzene MOs. 

d) a + /2B + 28(0.353)?(2) = a + 1.93138. compared to o + 26 for 
benzene. 
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(Shading means 
negative phase) 


(a) 


The allyl coefficients are renormalized by dividing by 4/2. 


ally! MO combinations benzene MOs 


(b) 


12-22. a) Only 73 < 7272 is dipole allowed. 
b) It is polarized along the outer C-C bonds. If we pretend the molecule is 
linear, this is the line connecting the carbons. If we adopt a more realistic 
structure, it depends on whether we choose a cis or trans structure: 
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12-23. 


12-24. 


12-25. 


12-26. 


12-27. 


12-28. 


12-29. 


12-30. 


Answers to Problems 


(9| — zip) =+(2s|r cos0|2pz) = +3 a.u. For $ =cos(a)2s + sin(a)2pz, 
maximum dipole occurs when cosa = 1/ 42, sing = +1 / 4/2, that is, when 


o= oP. This is reasonable since the mixing of degenerate states like these 
requires no energy “expense” in terms of the unperturbed hamiltonian. As soon 
as the slightest external field appears, the mixing will occur to the above extent 
to produce maximum dipoles. 


Since, for MO 4, c3 = c7, this MO is symmetric for reflection through the yz 
plane. The operator x is antisymmetric for this reflection. Therefore, if integral 
($4|x|$») is to be nonzero, $» must be antisymmetric for this reflection. Of 
the empty MOs shown, only $5 satisfies this requirement. There is no other 
symmetry operation present that will independently cause the integral to vanish, 
and $4 — $5 is likely to be the observed transition. 


a) The HOMO is antisymmetric for reflection through the yz plane. (Assume 
that the coordinate origin is in the center of the 5-10 bond.) The x operator 
is also antisymmetric. Therefore, the allowed transition should be to an MO 
thatis symmetric for this reflection. There are three such MOs, at energies of 

F1.000, 4-1.303, 4-2.303 in units of — £. But the middle of these disagrees 
in symmetry with the HOMO for reflection through the xz plane. Therefore, 
only the other two transitions are allowed. 

b) Here, the integrals containing y will vanish by symmetry except for transi- 
tions to +0.618 and +1.303, but the latter state disagrees in symmetry with 
the HOMO for the yz plane reflection. Hence, only —0.618 — +0.618 is y 
allowed. 

c) No zr—r transition is z allowed. 

d) Transitions to 1.303 and 1.618 are not allowed for any polarization. 


The field polarizes the atom, which means that the 2s state acquires 2p character. 
But the 2p — Is transition is allowed, and so the atom now relaxes to the Is 
state. 


4 «— 7 allowed, polarized along C—O axis. 3 «— 7 forbidden. 2 <7 and 1 —— 7 
allowed, polarized perpendicular to molecular plane. 


a) 4,0 — a, œ + 0.58. 

b) æ, æ — œ + 0.258, œ + 0.25 p. If we view this as a sequence of substitutions, 
then case a) is clearly a case of cooperative perturbations since the proper 
zeroth-order orbital that is strongly lowered by the first substitution is lowered 
again by the second. For case b), the second substitution affects the other 
one of the proper zeroth-order orbital energies. To this order, a) should be 
more stable since the lowering of energy due to the two electrons in this level 
is 8, whereas the most that can come from b) is 6/2. [Analysis of character 
tables (Chapter 13) enables us to see that the degeneracy for case b) must 
become split at higher levels of perturbation because a C», molecule has only 
one-dimensional representations, hence cannot have degenerate orbitals.] 


Lowest energy rises by 0.06678. One of the next pair rises by 0.16 and the 
other drops by 0.03335. 


—30 
x,y components = 0. z component = 2223 a.u. = ED E 2AB "Cm 
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MC:cacb 
Chapter 13 


13-1. 
13-2. 


13-3. 


13-4. 


13-5. 
13-6. 


13-7. 
13-8. 
13-9. 


13-10. 


No. “Come about 180?" is needed for closure. 


It means that for each operation there is a right inverse as well as a left inverse. 
If AB = E, then A is the left inverse of B and B is the right inverse of A. 


a) C4. b) 4. c) Probably only three, since one tends to think of left face and right 
face as being in same class. However, in the C4 group they are not, because 
there is no operation in the group to interchange them (such as reflection through 
a plane containing the z axis). 


There are many possible arrangements counted in 4! that are not physically 
achievable through operations in the group. For example, 

1—2 1—4 

Il IAI | 

4—3 3—2 


For C3y, all possible arrangements are accessible. 

a) Cy. b) Dap. c) Dag. d) Ta. 

a) UU = UU! — 1 (upon explicit multiplication). 
1 

b) Upon explicit multiplication, UAU = f l 


a) Den (yes). b) C», (no). c) Dan (no). d) D3a (yes). e) Cs, (yes). 
a) Dan. b) (1) a5; (2)e"; (3)a5; (Hal. 
[See Hint.] Dog, 


MO number: 1 2 3 4 5 67 89 10 11 12 13 14 15 16 
“Sa aid aii Mei ee an te! 


MO symmetry: b2 a, aj b2 e e e bo e a, bo a, 


The highest occupied MO is 9,10; the lowest empty MO is 8,7, and so the 
transition is e — e. 


(six or yids) = f ewe@e= f eo eee eco 
(iau) = [eonee= [ener ehr40 
Transition is allowed for (group theory) z polarized light. This means x polar- 


ized for the coordinate system shown. 


a) and b) can be checked against C», character table. 

c) Hydrogens generate characters 2 0 2 0, which is a1 Q b1. 

d) The unnormalized symmetry combinations are: a4, 1s4 + lsg; 51,184 — 
lsg A 
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13-11. 


13-12. 


13-13. 


13-14. 
13-15. 
13-16. 


Answers to Problems 


a) and b) can be checked against the C4, character table. 

c) Hydrogens generate characters 4 0 0 2 0 or 4 00 02, depending on how oy and 
oq are selected. Assuming that o, contains corner ammonia molecules gives the 
former set. This resolves to aj @ bj ® e. (The other choice gives b2 instead of 
b1, but reversal of choice of oy and o4 has the effect of interchanging the symbols 
b, and bo, and there is no real difference involved in this choice.) For a situation 
where the nitrogens are numbered as shown in the figure, the unnormalized 
symmetry orbitals are: a1, 2s + 2s2 + 283 + 284; b1, 281 — 282 + 283 — 284; 
e, 2s; — 2s3 and 2s» — 2s4; or 2s, + 2s? — 2s3 — 2s4 and 2s, — 2s»? — 283 + 
2s4. (Other combinations are also possible, but these two sets are the most 
convenient.) 


a) Four operations means order 4. Four classes means four representations. 
Hence, each representation must be one dimensional, therefore having char- 
acter +1 or —1 for every operation. One of these must be +1 everywhere 
(41). The others must all have +1 in the first column and —1 in two of the 
other three columns. The result is as given in Appendix 13 for Coy. 
Twelve operations gives order 12. Six classes means six representations. 
This must mean two 2 x 2 and four 1 x 1. There is but one unique set of 
orthonormal character vectors that fit this framework. Check against the 
Day character table. 


b 


— 


Two two-dimensional representations and two one-dimensional representa- 
tions. 


Check against C2y. 


Order — 12. There are six classes. 


11 = Pfs 
1ji-11 1 

= Px = sp? set 
2}1-1-1-1] |p, 


(ie es es e 
1//3 0 V2/V30\ fs 

1/43 1/42 -1/V60] | px 

1/43 —1/4/2 -1/V6 0 | | py 

0 0 0 1) Vp 


— sp? set 


For each matrix T, T! T — 1. 


13-17. Yes. Since various hybridized sets are equivalent, the final result of the calcu- 


13-18. 


13-19. 


lation is independent of one's choice. 


a) 42 2B, GE. Yes. 
b) Ag Q2 Ba,. No. 
a) e@ by Re=ay Par @ bı Q b». Need not vanish. 


b)eGeGe-—edcedcedq e. Must vanish. 
c) ay ® b2 Q b2 =a. Need not vanish. 
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13-20. 
13-21. 


13-22. 


d) ay Q b2 a1 = b2. Must vanish. 
e) ay Qe Q b» =e. Must vanish. 


eg > Adu, Cy are x, y allowed. eg — ey is z allowed. 


The symmetry of the molecule is Coy. All 2 MOs are antisymmetric for C? so 
must be bases for b; or b» representations. The function y (for the molecule) 
is z (in the table), and so it is a basis for the a, representation. For the product 
$4 yó, to contain a, symmetry, it is necessary, then, that $, have the same 
symmetry as $4. p6 and $7 do have the same symmetry; therefore, $4 — $e, $7 
are y allowed (where y is coincident with the symmetry axis). 


8) A nonspecial point above the top face has seven equivalent positions above 
that face (into which it can be sent by various symmetry operations, so there are 
eight equivalent positions above that face. Because the two sides of the square 
are identical, there are another eight equivalent positions below. Therefore, we 
expect a group of order 16. (See D47.) b) Like the square, each face of the cube 
has eight equivalent points. The inside of a face is not equivalent to the outside, 
so each face is limited to eight equivalent points, not 16. Six faces times eight 
points per face yields a predicted group order of 48. (Oj) 


MC:dcaedadab 
Chapter 14 


14-1. 


14-2. 


14-3. 


14-4. 


14-5. 


14-6. 


(15 4| Aya — l/rp|1s4) = -i — (1/ R) + [(1/R) + 1]exp(—2R) where R is 
distance between nuclei. For R =2 a.u., Eelec = —0.9725 a.u., Etot = —0.4725 
a.u. For R=1 a.u., Eelec = —1.2293 a.u., Etot = —0.2293 a.u. For R=3 a.u., 
Egec = —0.8300 a.u., Etot = —0.4967 a.u. 


Expected QMOT behavior is reversed for very low H44. The “antibonding” 
MO is lower than the “bonding” for H44 = —20. This happens because the 
loss of energy involved in removing charge from the atoms is not compensated 
by that gained by putting the charge into the bond region. 


The molecular electron affinities tend to be greater than for the atoms when the 
extra molecular electron goes into a bonding MO, smaller if into an antibonding 
MO. QMOT leads us to expect the MO energy to be lowered (raised) from 
separated atom levels if the interaction is bonding (antibonding). Koopmans' 
theorem then leads to predictions in qualitative agreement with these data. 


The observed ground-state angles are more consistent with the triplet-state con- 
figuration. The first excited state should then be a singlet and have a smaller 
angle. (Accurate calculations give 103?.) 


It should become more bent. (Also, the O-H bonds should lengthen, although 
this is not the kind of geometric change that the figure explicitly treats.) 


The computed results will largely agree with QMOT ideas. However, it is 
possible that the log — la; level will rise where Walsh's rules predict it will 
fall. [Whether this occurs depends upon details of EHMO parameter choices.] 
Upon analysis, this turns out to result from a situation similar to that examined 
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14-7. 


14-8. 


14-9. 


14-10. 


14-11. 


14-12. 


14-13. 


Answers to Problems 


in Problem 14-2. That is, the increase in H-H overlap population comes at the 
expense of population elsewhere. If the energy associated with this “population 
elsewhere" is low (i.e., if the original MO energy is low enough) the energy 
cost is greater than the gain due to increased H-H overlap population. This 
inversion of behavior for low levels is ignored in the Walsh approach. However, 
it does not occur for a given MO until it is fairly deeply buried under higher 
filled MOs. Also, since the higher MOs dominate the behavior of the molecule 
anyway, the inversion does not affect our prediction. 


Since the exact answer depends on details of your EHMO program, allow us to 
take this opportunity to toast your good health. 


If S= Socos0, d S/d0 = —Sosin0. AS for 0 =0 to 0 = 30? equals —(1 — 
0.866) Sp = —0.134S 9; for 90 > 60°, AS = — So (0 — 0.5) = 0.5 50. 

For a diagram and discussion, see Gimarc [1]. Those with six valence electrons 
(the first three in the list) are planar. Those with eight electrons (the last four) 
are pyramidal. 


Sketches and discussions of AB? molecules may be found in the literature [2,3]. 
Molecules with 16 or fewer valence electrons (BeCl2, C3, CO», N; ) should be 
linear. Those with more than 16 should be bent (NO2, O5, F20). 


Bending should increase end-to-end antibonding. These MOs favor the linear 
form [see (V)]. 


* —AN O 


(degenerate) 
(V) 


Bending should increase end-to-end bonding. These MOs favor the bent form 


[see (VI)]. 


606—000 
(degenerate) 


(V) 


Therefore, zt, — my should make molecule more bent. 


For allyl, draw the three 2 MOs. For the cyclic molecule, draw a C-C bond 
(lowest in energy), a single p-z: AO on the negative carbon (intermediate energy) 
and the C-C antibond (high energy). Conrotary motion preserves a C» axis, 
disrotatory preserves a reflection plane. For the C? axis, the allyl MO symme- 
tries are (in order of increasing energy) A, S, A. For the cyclopropenyl anion, 
they are S, A, A. For the reflection plane, the same MOs have symmetries S, 
A, S; and S, S, A. The resulting predictions for the four-electron anion are that 
thermal closure goes conrotatory, photochemical goes disrotatory. 


[See Hint.] The diagram is given in (VII). o and z refer to symmetry (S or A) 
for reflection through the molecular plane. The second symmetry symbol refers 
to reflections through a plane between the two acetylenes. The third symbol 
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14-14. 
14-15. 
14-16. 


refers to a reflection orthogonal to the first two (1.e., bisecting both acetylenes 
at their bond midpoints). The reaction does not appear favorable for a thermal 
mode. Photochemically it is less unfavorable, but still corresponds to a rather 
highly excited product. A square planar intermediate is unlikely for either mode. 

2C,H, 


(VII) 
No states can differ in symmetry in this case, and no crossings should occur. 
See Woodward and Hoffmann [4, pp. 23, 24]. 
See Woodward and Hoffmann [4, pp. 70, 117]. 


Chapter 15 


15-1. 
15-2. 


15-3. 


15-4. 


(h/2x)k = p, 2 /2m E = J/2mT — h(1/A). 


[See Hint] dk/d E = (dE/dk) |! —4n?m/ kh? e 1 / k e L/A/ E so plot E vs 
1/VE. 


[See Hint. ] 

a) For example, ¢2 = (1/6) [pz T exp(27i/3)pr2 — expGri/3)paa3 + Prat 
exp(27i/3)px5 — expGti/3)pso] -E2 =a — p. 

b) j =0 gives exp(0) =1 at every atom, j = 6 gives exp(2(n — 1)zi) — 1 at 
every atom. 


[See Hint.] 

a) DOS, “normalized” by dividing by N, and in units of |B|- ! , is 1/(z sin(Azr)), 
where h = j/ N and 0 x h x 1/2. E=a — 2|B|cos(Az). Therefore, DOS = 
1/(z sin(are cos [CE — o)/2|8]])]. 

b) 2N states spread uniformly over an energy range of 48 gives a “normalized” 
DOS of 0.5||- !. 


. For example, j 2 —1— (1/4/6) [px + exp(—i/3)pz2 + exp( -2zi/3)pg3— 


Pra — exp(—zi/3)p«5 — exp —2zii/3)ps«]. Sum of j = +1 functions divided 
by 4/2 gives one real function, and difference divided by i4/2 gives the other. 


686 Answers to Problems 
15-6. Requires that exp(—2zriqn N /n) equal 1. This requires that the argument equal 
zi times an even number, which is the case if q N is an integer. Since q and N 


are integers, so is q N. 


(C;H;), at |k] = 7/3a 


(CH);, at |k| = m/7a 


15-7. (CH)2, has lower energy. Variation would not affect (CH)2,, but would mix z 
and zr* versions of (C2H2), to produce COs like those for (CH)5,. (Note that, 
for convenience, we have arranged phases so that the node is identically placed 
in these diagrams.) 


15-8. The HOMO and LUMO at |k| = x/3a respond oppositely to distortion that 
lengthens every third bond and/or shortens the others. 


15-9. Symmetry for reflection through a plane perpendicular to the polymer axis is 
not useful for this analysis. Symmetry for reflection through a plane containing 
the atoms divides the COs into c and x types. Only o—r crossings are allowed. 


15-10. The c bands involve 2s and 2p AOs. One band pair includes 2s AOs with 2p 
AOs that are oriented parallel to the screw axis. The other band includes no 2s 
AOs and only 2p AOs perpendicular to the screw axis. The former functions 
form a basis that is symmetric for the symmetry operation, whereas the latter 
form a basis that is antisymmetric: 


hd = . e 
° Foz d ° symmetric (s or p) 


e, 9 e 9, . e 
— antisymmetric 


15-11. At high pressures, intermolecular distance becomes the same as intramolecular 
distance. This yields a zero “gap” at the Fermi level (same as polyacetylene 
with uniform bond lengths). 


15-12. 


0 k ala 
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15-13. 


15-14. 


15-15. 


15-16. 


15-17. 


15-18. 


15-19. 


15-20. 


a) €HOMO = —8.19eV. b) €LUMO = —4.09 eV. c) E gap = 4.10eV. d) Gap 
should open (both up and down) so egowo becomes lower and IE is higher. 
€Lumo is higher and EA is less negative. Computed values: IE =8.39 eV , EA = 
—2.26eV, Egaąp =6.13 eV. 


In Chapter 8 we show correlations between egowo and IE and between eLUMO 
and EA, with different values for B in each case. The difference in interelectronic 
repulsion and exchange for neutral and anionic species is handled implicitly in 
this manner by simple Hückel theory. 


Only zr4 has the proper symmetry to combine with z to make c1 = c4 Z c2 = c3 
= c5 = cg. Mixing causes the bands to split apart. xı is pushed down by 7r4, 
and 74 is pushed up by z,. The coefficients decrease on carbons 1 and 4 in 74, 
consistent with a rise in energy due to less bonding. 


No. Atoms 2 and 6 have zero coefficients in the appropriate COs (see nA of 
Fig. 15-26), but atom 5 has a nonzero coefficient. 


The band diagram is identical to the one in Fig. 15-23 except that the o + £ 
level at k — 0 has an intended correlation with a — £ at k = x /a, and vice versa. 
However, these have the same symmetry (anti for o2), hence avoid crossing. 
The resulting diagram is identical to that in Fig. 15-23 except that band 7r? has 
an intermediate hill and 75 an intermediate valley—height and depth unknown 
until a variational calculation is performed. 


Edge energies without substitution at k — 0:o 4- 2.58, 0,0 — B, — 1.58. At 
k-—z/a:o4-1.58,o 4- B, o, o — 2.5. (Proper zeroth-order MOs for the E =a 
level of cyclobutadiene are the versions having nodes through atoms rather than 
through bonds.) The lines at a,a — 6 for k = 0 cross. This is allowed by 
symmetry disagreement for reflection through the plane containing the polymer 
axis and perpendicular to the molecular plane. Edge energies are lowered by 
substitution by the following amounts as we move up the diagram on the k =0 
side: 8/8, B/4,0, B/8. On the k = x/a side: 8/8,0, 8/4, B/8. Symmetry 
is maintained so the crossing is still allowed. For only one substitution, the 
energy lowerings are half as great. Symmetry is lost, and the crossing becomes 
avoided. 


Like Fig. 15-28a with the middle horizontal row shaded. Higher in energy than 
T, but lower than X or M. 


la) (bl (c) (d) 
o © © O -O0 O OO 80 . © 


o 00 O.:.0 000 ©: © 


(7/2a,, mlaz) (lai, 7/2a2) (/2a,, 0] (n/2ai, 7/2a2) 
E> E, Eg» E, 
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15-21, OO OO GO 


T'(0, 0) X(0, 7/2) X' (1/2, 0) 

The energy level for X is lower than those for I and X’. For X, all interactions 
are bonding. For I and X’, nearest-neighbor interactions are antibonding and 
longer-range interactions of both bonding and anti bonding types occur. Judging 
the relative energies of IT and X’ is complicated since we need to evaluate 
relative magnitudes of interactions between p AOs on the same long edge of 
the rectangle and also on opposite corners. Since p AO interactions depend on 
distance and angle, this is not trivial, as it is for s-type AOs. 


15-22. All are nondegenerate. 


15-23. The RFBZ is triangular. Referring to Fig. 15-27c, the RFBZ shown there 
becomes square, but symmetry now makes points on opposite sides of the Il — M 
line equivalent (e.g., X and X’ become equivalent), so only the - X-M-T line 


is needed. 
15-24. 
r M 
I is higher in energy 
Both are nonbonding and would have the same energy. 
15-25. (n/a, —mla) p, 
(«/a, 0) 
(0, /a) 
LE 


(0, w/a) 


COs at (0, z/a) and (7 /a,0) are the same except that the nodal planes are 
rotated by 60°. 
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15-26. 


15-27. 


The crystal is symmetrical for reflection through a hexagonal sheet. Plane waves 
experiencing the same potential and having the same |k| are degenerate. 


Atthe top of the RFBZ, unit cell functions are stacked with sign reversal. If the 
upper and lower phases of the unit cell functions agree, this means that we must 
have a bonding interaction in the cell, but we will also get antibonding between 
cells, hence no splitting. On the other hand, if the upper and lower phases are 
opposite, we must have antibonding in the cell but we now get bonding between 
them, again for no splitting. The same is true for sigma bonds. 


15-28. 

m(M) + m*(M) z(M) - z*(M) 
Appendix 2 
A2-2. a) 2. b) 1. )0. d) x 2 £2. 


A2-4. 
A2-5. 
A2-6. 


Determinant of coefficients vanishes, and so nontrivial roots do exist. 
c=2,-1+ /7 are the roots. 


a) If two rows or columns differ by factor c, then multiplication of the smaller 
row or column gives |M’| =c|M| (by rule 1), and M’ has two identical rows 
or columns. Interchanging these gives M", and | M'| 2 —|M"|. But M" = M" 
because the interchanged rows or columns are identical. Therefore, | M'| = 
—|M'|, and | M'| =0. Therefore, |M| = c| M'| — 0. 


Appendix 4 


A4-1. 


A4-2. 


A4-3. 


From Eq. (A4-47), L-L} = L4 (Lz +1). Then L;L4Yrg = L4 (L241) m = 
Ly (km + 1)Yim = (km + 1)L4Yim. 


LyL_=(Ly +ily)(Ly —iLy) =12+14,+iLyL, ^ iL, Ly. Equation (A3- 
31) tells us that iLyLy iL; Ly —i*L;— Lz, so L4 L- = L? + 12 + L- = 
L? + LI, +L} -— L+ L:= L? — L? + Lz. Therefore, L? = L4 L- + L? — Ly. 


Equation (A4-55) indicates that the result of L} on Y; m remains an eigen- 
function of L? with the same eigenvalue, i.e., that Lbs Yim = Lue Yim41— 
ki C, Yi,n41. But also, L} L?¥)m = L4 Kj Yi, = C+kYi,m41, so Eq. (A4-55) 
indicates that L} and L? commute. This is easily verified since L4 is a linear 
combination of Ly and L,, both of which commute with L?. We can estab- 
lish Eq. (A4-55) by evaluating the result of the reverse order of operations and 
using the fact that L, and L? commute: Tay = L4kiYi m =k L+Yim = 
kiC Yı m41. But L4 LY; m = L? L} Yi, m, so L?L4 Ym — Ki Co Yi m41. QED. 
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A4-4. Ly =(1/2)(L4 + L_), 0/2) (Yi m| L4. + L-| Yrs) = 0/2)1C4 QO1,mlYrnm41) + 
C  (Yr,n|Y1,n -1) 2 0 +0. 


A4-5. a) (10) (i) =0. 


0 
b) s= sise 2 


STE 
EOE 
ase tastesies= 1|(? Nee) Co) o^) 


Appendix 8 
A8-1. Î = — 4d? /dx?, V= jkx?. Equation (A8-19) gives En = 5? T + nV. 
dE /dn =0=2nT — 259 ?V. For an exact solution, n = 1, and T=V. 


A8-2. V = —3 au, T= 4 au, n= —V/2T =3, Wy = Vm [nexp(-nr) = 
J27/n exp(—3r), E, =n°T + nV =} — 9 = —4.5 a.u. 
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C-N length, 555 
CNDO/1, 386t 
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Commuting operators, 175-176 
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electron motion and, 378 
size consistency of, 366 
truncated, 366 
Conjugate variables, 18 
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Conrotatory closure, 508, 513 
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Index 


Correlation diagrams 

configuration, 519f 

description of, 46 

orbital, 519 

state, 519f 

symmetry in, 514 

symmetry-forbidden, 517 
Correlation energy, 357—358 
Correspondence principle, 31 
Coulomb integral, 249, 413, 587—590 
Coulomb operator, 350—351, 620 
Coulomb terms, 369 
Coupled cluster theory, 366—367 
Covalent character, 363 
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in linear homogeneous equations, 585 
secular, 200-201, 211 
Slater, 137—139, 349, 369, 622 
topological, 250 
2 x 2, 584 
Determinantal equation, 248 
Diagonal matrix, 311 
Diatomic molecules 
description of, 84—85 
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closed shells of, 226 
electronic states of, 229 
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Bloch's theorem, 533-534 
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description of, 20, 32 
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lowest-energy, 190 
nondegenerate, 131, 218 
1s, 192 
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symmetry of, 244 
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unperturbed, 395 
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description of, 20 
extended Hückel method, 328—331 
formaldehyde, 343 
of Hermitian operators 
completeness of, 176-177 
degenerate, 174 
expressed as an orthonormal set, 
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nondegenerate, 173-174 
orthogonal set formed from, 
173-174 
proof of, 172-173 
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351-352 
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396-403 
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7 -electron energy, 279—284 
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159-160 
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Electron index order, 618 
Electron motion, 378 
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7t -electron repulsion energy, 277 
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Elliptical coordinates, 217 


Index 


Energies 
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description of, 30-32 
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experimental energies and, 
340—342 
Mulliken populations and, 
338-340 
ground state 
description of, 368, 371t 
to first-order of heliumlike 
systems, 403—405 
Energy level splitting, 47 
Energy to first order, 392, 397—399 
Energy to zeroth order, 392 
Equation of motion, 69—70 
Equivalent electrons, 156—159 
Equivalent orbitals, 472 
Equivalent representations, 445, 458 
Ethane, 440 
Ethylene, 263-264, 606—607 
Ethylene molecular orbitals, 268 
Exchange integrals, 142, 619 
Exchange operator, 351 
Exclusion principle, 136, 146 
Experimental energies, 340—342 
Explicit integration, 397—398 
Exponentials, 39 
Extended Hiickel energies 
experimental energies and, 340—342 
Mulliken populations and, 338-340 
Extended Hückel method 
band calculations, 560 
basis set, 324—325 
description of, 324, 541 
eigenvalues, 328-331 
eigenvectors, 328-331 
hamiltonian matrix, 326—328 
K parameter, 333-335 
Mulliken populations, 335—340 
nuclear coordinates, 324 
overlap matrix, 325-326, 344 
for polyacetylene, 552-554 
total energy, 331—332 
External potential, 368 


F 
Fermi contact interaction, 193 
Fermi energy, 540, 550, 577 
Fermion, 136 
Filled molecular orbitals, 472 
Finite central barrier, particle in an infinite 
“box” with, 44-47 

First Brillouin zone 

description of, 536, 564 

reduced, 540, 564, 566, 577 
First-order corrections 

to y1, 399-401 

description of, 392-394 
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First-order Stark effect, 411 
First-order structure, 485 
Fock operator, 350-351 
Force constant, 69 
Formaldehyde eigenvalues, 343 
Formaldehyde orbital numbering, 343 
Free particle in one dimension 
description of, 47—50, 526—529 
particle in a ring problem and, 
similarities between, 52 
Free radical reactions, 292 
Free valence, 292 
Frequency factor, 4 
Frontier orbitals, 291, 505—508 
Frozen atomic orbitals, 486 
Fulvene, 606, 610 
Functions 
acceptable, 22 
angle-dependent, 97 
Bloch, 534 
continuous, 532 
Dirac delta, 170 
distribution, 70 
electron probability density, 93 
Gaussian, 355-356 
Kronecker delta, 38 
Legendre, 108, 589 
linear combination of, 7 
linearly independent, 77 
orthogonal, 385 
orthonormal, 38 
polarization, 355 
single-valued, 21 
square-integrable, 22 


G 
Gauss error function, 80 
Gaussian basis set, 353 
Gaussian functions, 183, 355—356 
Gaussian wave packets, 184 
Generalized gradient approximation, 370 
Gerade, 213, 215, 548 
Gimarc's diagram, 502 
Givens-Householder-Wilkinson method, 320 
Graphite, 565-576 
Gross atomic orbital population, 337 
Ground state energy 
description of, 368, 371t 
to first-order of heliumlike systems, 
403-405 
Group 
abelian, 430 
definition of, 430 
point, 441—443 
representations for 
from basis functions, 446—451 
cyclic groups, 453—456 
description of, 443-446 
irreducible inequivalent, 456-458 
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labels, 636—637 
labels for, 451—452 
molecular orbitals and, 452—453 
one-dimensional, 444, 454, 473 
two-dimensional, 444, 454 
symbols for, 635-636 
symmetry point, 431—434 
Group theory 
elementary example of, 429-430 
overview of, 429 


H 
H2, 488-490 
H', 401 
H»5* molecule-ion 
antibonding states of, 487 
bonding states of, 487 
description of, 206-220, 485-488 
H» vs., 488-490 
H3AAH3 system, 501 
HAB, 499, 500f 
HAH, 497, 499 
Hamiltonian, 20, 349, 404 
Hamiltonian matrix 
description of, 319, 326-328 
integrals of, 473 
partitioned, 469f 
Hamiltonian operators, 20, 53, 131—132, 
180, 593—595 
Harmonic electric-field wave, 9f 
Harmonic motion, 70 
Harmonic oscillation, 2 
Harmonic oscillator 
one-dimensional, 69—72 
quantum-mechanical, 72-74 
Schródinger equation for 
asymptotic behavior, 74—75, 
78-79 
description of, 72-73 
differential equation for q(x), 
75-76 
energy spectrum, 79-80 


as a power series, 76-77 
recursion relation for, 77—78 
simplifying of, 74 
wavefunctions, 80—81 
wavefunctions 
description of, 73—74, 80-81 
normalization of, 81—82 
orthogonality of, 81-82 
zero-point energy of, 74 
Harmonic wave, 2, 3f 
Hartree-Fock energy 
correlation energy, 357—358 
density functional theory and, 381—382 
estimating of, 371 
restricted, 357, 360 
unrestricted, 357 
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Hartree—Fock energy curve, 381 
Hartree—Fock equation 
atomic orbitals used with, 623 
definition of, 623 
derivation of, 614—623 
description of, 350 
Hartree—Fock limit, 357, 627 
Hartree—Fock wavefunctions, 375t, 627 
Heat of hydrogenation, 279 
Heisenberg uncertainty principle, 18-19, 31 
Helium atom 
description of, 134 
nonlinear variation for, 194—197 
1s2s configuration of, 138-144 
Heptatrienyl, 606 
Heptatrienyl radical, 611 
Hermite polynomials, 82 
Hermitian adjoint of a matrix, 310 
Hermitian matrix, 317 
Hermitian operators 
description of, 171—172 
eigenvalues of 
completeness of, 176-177 
degenerate, 174 
expressed as an orthonormal set, 
174-175 
nondegenerate, 173-174 
orthogonal set formed from, 
173-174 
proof of, 172-173 
Hermiticity, 174, 213 
Heteroatomic molecules, 284—287, 285t 
Hexatriene, 510—511, 606, 610 
Highest occupied molecular orbitals, 275, 
291, 493, 505, 507f, 519, 540 
Homogeneous magnetic field, 133 
Homonuclear diatomic molecules 
closed shells of, 226 
electronic states of, 229 
molecular orbitals of, 220—231 
properties of, 225t 
qualitative molecular orbital theory 
rules applied to, 490—494 
symmetry orbitals of, 227f 
Hooke's law, 69 
Hückel molecular orbital method 
assumptions, 601 
benzene, 260, 267—268 
bond length, 269—270 
bond order, 269-270 
butadiene, 264—265, 279 
charge distributions from, 256-259 
cyclobutadiene, 265—267 
determinant 
o quantity, 248—249 
P quantity, 249 
basis atomic orbitals, 247—248 
constructing of, 247—248 
generalizations, 259—263 
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manipulation of, 249—250 
overlap integrals, 249 
topological, 250 
determinantal equation 
for cyclopropenyl system, 
253-256 
description of, 250 
solving for molecular orbitals, 
251-253 
solving for orbital energies, 
250-251 
ethylene, 263—264 
heteroatomic molecules, 284—287, 285t 
for hydrocarbons, 268—269 
independent zr-electron assumption, 
246-247 
overview of, 244 
perturbation at an atom in, 406—409 
reaction indices, 289-295 
self-consistent variations of œ and f, 
287—289 
o -zt separability, 244—246 
summary of, 295-296 
Hund's rule, 148, 159—160 
Hybrid orbitals, 470-471 
Hydrocarbons 
Hiickel molecular orbital method for, 
268—269 
Mobius conjugated, 283 
neutral alternant, 602 
Hydrogen atom 
nonlinear variation for, 191—194 
polarizability of, 197-206, 410-412 
s-type states of, 418 
Hydrogenlike orbitals, 146, 208 
Hyperfine splitting constant, 272 


I 
Identity operation, 459 
Improper axis, 436 
Improper rotation, 436 
Independent electron approximation, 
127-129 
Independent x -electron assumption, 246—247 
INDO, 386t 
Induced dipole, 411—412 
Inductive model, 286 
Inhomogeneous magnetic field, 133 
Inner repulsion energy, 588 
Instantaneous dipole moment, 171 
Integrals 
bond, 249 
Coulomb, 249, 413, 587—590 
exchange, 142 
list of, 582-583 
overlap, 249 
resonance, 249 
Integrand, 472 
Intended correlations, 545 
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Interaction element, 204 
Intermediate neglect of differential overlap, 
386t 
Internal rotation, 485 
Internuclear repulsion energy, 209 
Inverse matrix, 311—312 
Inverse operation, 430 
Ionization energy 
description of, 278-279 
of neon, 372t 
valence state, 327 
Irreducible inequivalent representations, 445, 
456-458 


J 
Jacobi method, 319 
Jahn-Teller theorem, 258 


K 
K parameter, 333-335 
Kinetic energy 

description of, 43 

equation for, 11 

of photoelectrons, 11, 12f, 278 
Kinetic energy operators, 140 
Kohn-Sham orbitals, 369—370 
Koopmans' theorem, 358—360, 372, 472 
Kronecker delta, 82, 170 
Kronecker delta function, 38 


L 
L?, 595-600 
Lagrangian multipliers, 620 
LCAO-MO-SCF equation. See Linear 
combination of atomic 
orbitals-molecular 
orbitals-self-consistent field 
equation 
Legendre functions, 108, 589 
Legendre polynomials, 107 
Light 
as electromagnetic wave, 9-10 
electromagnetic field theory of, 12 
Light intensity, 17f 
Linear combination, 7 
Linear combination of atomic 
orbitals—molecular orbital 
calculation, 208 
Linear combination of atomic 
orbitals—molecular 
orbitals—self-consistent field 
equation 
density functional theory methods, 
368-370 
eigenvalues, 351—352 
ground state wavefunction, 361 
multideterminant methods, 367 
overview of, 350-351 
Linear harmonics, 110 
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Linear homogeneous equations, 585 
Linear momentum, 18 
Linear polyenes, 262 
Linear position, 18 
Linear variation method 
matrix 
formulation, 315-317 
hamiltonian, 319 
overview of, 308 
Linearly dependent functions, 7, 76 
Linearly independent functions, 77 
Lithium, 135 
Local density approximation, 370 
Localization energy, 294 
Localized orbitals, 472 
Lowest unfilled molecular orbitals, 275, 291, 
505, 509 
Lowest-energy eigenfunction, 190 
Lowest-energy molecular orbitals, 222, 252, 
262, 327, 520 
Lowest-energy wavefunction, 93-97 
L-S coupling, 152 
Lz, 595-600 


M 
Magnetic moment 
angular momentum and, 115-117 
definition of, 115 
Magnitude, 592 
Mass density, 95 
Matrix 
addition of, 310-311 
complex conjugate of, 309, 311-312 
definition of, 308 
diagonal, 311 
of eigenvalues, 316 
of eigenvectors, 316 
expression of, 308—309 
geometric model, 312-314 
hamiltonian, 319, 326-328 
hermitian, 317 
hermitian adjoint of, 310 
inverse, 311—312 
multiplication of, 310—311 
nonsingular, 314 
orthogonal, 314 
product of, 311—312 
rotation, 313 
similarity transformation, 314 
singular, 314 
square, 311 
symmetric, 310 
transformation, 314 
transpose of, 309-312 
unit, 311 
Matrix equation, 317—320 
Matter, wave nature of, 14—16 
Maxwell’s differential equations, 10 
McConnell relation, 272 
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Metastable states, 501 
Methane, 339 
2-Methoxyethanol, 277 
Methylene cyclopropene, 606, 608 
Microscopic systems, 18 
MINDO/3, 386t 
Mirror A universe, 434—435 
Moller-Plesset perturbation theory, 366 
MNDO, 386t 
Mobile bond order, 259 
Mobius conjugated hydrocarbons, 283 
Modified intermediate neglect of differential 
overlap, 386t 
Modified neglect of diatomic overlap, 386t 
Molecular orbitals 
7,416, 418 
antibonding, 217, 226, 258 
atomic orbitals in, 465—467 
benzene, 530—531, 535, 556 
bonding, 217 
for cyclopropenyl system, 255 
definition of, 208 
delocalized, 472 
ethylene, 268 
filled, 472 
frontier, 505—508 
hierarchy in, 484—485 
higher-energy, 490, 498 
highest occupied, 275, 291, 493, 505, 
507f, 519, 540 
of homonuclear diatomic molecules, 
220-231 
Hiickel molecular orbital determinantal 
equation solved for, 251—253 
for linear polyenes, 262 
lowest unfilled, 275, 291, 505, 509 
lowest-energy, 222, 252, 262, 327, 520 
nonbonding, 252, 602-604 
nondegenerate, 329, 467 
occupied, 362 
representation table and, 452—453 
self-consistent field—molecular orbitals, 
384—386 
subjacent, 506 
superjacent, 506 
symmetry of, 296, 463—465 
virtual, 362 
Molecular rotation, angular momentum in, 
117-118 
Molecules 
ab initio calculations for, 374—382 
homonuclear diatomic. See 
Homonuclear diatomic 
molecules 
point group of, 441—443 
Mulliken populations 
description of, 335 
extended Hückel energies and, 338-340 
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net atomic orbital, 335 

overlap, 335-336, 345, 541 
Multiconfigurational self-consistent field 
calculation, 367 


N 

Naphthalene, 606, 613 

NDDO, 386t 

Nearest-neighbor interaction, 484—485, 496 

Negative eigenvalues, 93 

Neglect of diatomic differential overlap, 386t 

Neutral alternant hydrocarbons, 602 

Newton's laws of motion, 4—5 

Nodes, 3 

Nonbonding molecular orbitals, 252, 
602—604 

Nonconcerted process, 509 

Noncrossing rule, 230 

Nondegenerate eigenfunctions, 131, 218 

Nondegenerate state, 6, 31 

Nondegenerate wavefunctions, 448 

Nonequivalent electrons, 153-154 

Nonsingular matrix, 314 

Norbornadiene, 416—417 

Nuclear motion, potential energy for, 207 

Nuclear spin angular momentum, 599 

Nucleophilic aromatic substitution, 289 


(0) 
Occupied molecular orbitals, 362 
Occupied orbitals, 353 
Octatetraene, 606, 612 
Off-diagonal determinantal element, 204 
One-dimensional harmonic oscillator, 69—72 
One-dimensional representations, 444, 
454, 473 
One-electron density function, 256 
One-electron energy, 128, 352 
Open-shell systems, 373 
Operators 
angular momentum, 52, 593—595, 599 
commuting, 175-176 
constructing of, 167—168 
coulomb, 350-351, 620 
definition of, 20 
exchange, 351 
Fock, 350-351 
hamiltonian, 20, 53, 180, 593-595 
permutation, 614 
postulate for constructing, 167—168 
quantum-mechanical, 592—593 
Orbital(s) 
atomic. See Atomic orbitals 
crystal, 539, 559f 
description of, 101—102 
equivalent, 472 
frontier, 291, 505—508 
hybrid, 470-471 
hydrogenlike, 146, 208 
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interaction between, 414—417 
Kohn-Sham, 369-370 
localized, 472 
molecular. See Molecular orbitals 
occupied, 353 
separated-atom, 228f 
Slater-type 
basis sets, 353-354 
contracted Gaussian functions for, 
356 
description of, 146-147, 324 
minimal valence, 471 
p-type, 354 
symmetry, 220—222, 227f 
Orbital approximation, 233-235 
Orbital energies 
definition of, 352 
description of, 128 
internuclear separation for He», 489f 
ionization energies and, 278—279 
oxidation-reduction potentials and, 
275-277 
solving of Hiickel molecular orbital 
determinantal equation for, 
250-251 
Orbital orthogonality, 617 
Orbital symmetries, 515 
Oriented dipole, 133 
Orthogonal functions, 385 
Orthogonal matrix, 314 
Orthogonal transformation, 314 
Orthogonality 
in irreducible inequivalent 
representations, 456—458 
of vectors, 457 
of wavefunctions, 37-38, 81-82 
Orthonormal functions, 38 
Orthonormal set, 174—175 
Oscillating electric charge, 10f 
Oscillating particle distributions, 527 
Oscillation, harmonic, 2 
Outer repulsion energy, 588 
Overlap integrals, 249 
Overlap matrix, 325—326, 344 
Overlap population, 335—336, 345, 541 
Oxidation-reduction potentials, 275—277 


P 
p orbitals, 149 
Pairing of roots, 601—602 
Pairing theorem, 262 
Particle(s) 
in an infinite “box” with a finite central 
barrier, 44—47 
in a one-dimensional box 
definition of, 27 
energies, 30-32 
with one finite wall, 38—43 
overview of, 27—30 
wavefunctions, 32-38 
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inaring 
of constant potential, 50-52 
description of, 529—530 
scattering of, in one dimension, 56-59 
in a square well, 27 
in a three-dimensional box, 53—56 
Partitioned hamiltonian matrix, 469f 
Pauli principle, 137—138, 148, 156, 178, 268 
Peierls distortion, 550, 570, 577 
Pentadienyl, 606 
Pentadienyl radical, 609 
Periodic potential, 526 
Periodic structures, 528 
Periodicity 
in three dimensions, 565-576 
two-dimensional, 562-565 
Permanent dipole, 411 
Permutation operator, 614 
Perturbation 
definition of, 391—392 
uniform electrostatic, 396—403 
Perturbation theory 
degenerate-level, 412-414 
Moller—Plesset, 366 
overview of, 391—392 
Rayleigh-Schródinger 
V» effects on, 402—403 
additive constant in H’, 401 
at an atom in the simple Hückel 
molecular orbital 
method, 406—409 
for degenerate state, 409-410 
first-order corrections, 392—394, 
399-401 
formal development of, 391—396 
ground-state energy to first-order 
of heliumlike systems, 
403—405 
overview of, 391 
WO, 401-402 
spectroscopic selection rules and, 
417-420 
Phase factor, 3 
® equation, 106 
Photoelectric effect, 10-14 
Photoelectrons, 11, 12f, 278 
Photons 
characteristics of, 14 
definition of, 12 
Einstein’s relation for, 14 
rest mass of, 14 
zt bands, 554—555 
Piecewise continuous, 28 
Planck, 10 
Planck's constant, 18, 631 
Point group of a molecule, 441—443 
Point of inversion, 436 
Polarizability of the hydrogen atom, 
197—206, 410-412 
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Polarization functions, 355 
Polyacetylene 
all-trans, 551—552 
with alternating bond lengths, 547—551 
energy calculations, 562 
extended Hückel molecular orbital 
method results for, 552-554 
self-consistent field results for, 552-554 
with uniform bond lengths, 537—546 
Polyatomic molecules, 495—505 
Poly-paraphenylene, 555-561 
Population density, 95 
Postulates 
for average values, 171 
for constructing operators, 167—168 
measured values related to eigenvalues, 
169-170 
Schródinger equation, 168—169 
wavefunction, 166—167 
Potential energy 
determination of, 383 
of electrons, 397 
for nuclear motion, 207 
quantum-mechanical average value of, 
83-84 
Probability density 
description of, 13 
volume-weighted, 95f 
Probability waves, 13 
V 
as stationary state, 180 
conditions on, 21—22 
triple value of, 22f 
y», 402-403 
p-type atomic orbital, 219 
p-type Slater-type orbitals, 354 
2px orbital, 101—102 
2p, orbital, 101—102 
Pyrrole molecule, 286 
2pz orbital, 101—102 


Q 


Qualitative molecular orbital theory 
H5*, 485-488 
molecular orbitals, 484—485 
molecular structure, 484—485 
need for, 484 
of reactions, 508—521 
rules for 
description of, 490 
homonuclear diatomic molecule 
applications, 490—494 
Quantum numbers, 30, 97-98 
Quantum-mechanical average, 96-97 
Quantum-mechanical average value of 
potential energy, 83-84 
Quantum-mechanical harmonic oscillator, 
72—14 
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Quantum-mechanical operators, 592—593 
Quantum-mechanical tunneling, 41 
q(x), 75-76 


R 
R equation, 108-109 
Radial node, 98 
Radical addition, 289 
Rayleigh-Ritz variation principle 
description of, 178 
proof of, 190 
Rayleigh-Schródinger perturbation theory 
w2 effects on, 402—403 
additive constant in H’, 401 
at an atom in the simple Hückel 
molecular orbital method, 
406—409 
for degenerate state, 409—410 
first-order corrections, 392—394, 
399-401 
formal development of, 391—396 
ground-state energy to first-order of 
heliumlike systems, 403—405 
overview of, 391 
WO, 401-402 
Reaction indices, 289—295 
Reactions 
electrocyclic, 508 
qualitative molecular orbital theory of, 
508-521 
stereospecific, 509 
Reciprocal lattice, 564 
Reciprocal operation, 430 
Reciprocal space, 562—565 
Recursion relation, 77—78 
Reduced first Brillouin zone, 540, 564, 
566, 577 
Reducible representations, 445, 462—463 
Redundancies, 432 
Reflection plane, 436 
Reflection symmetry, 205 
Representations 
from basis functions, 446—451 
cyclic groups, 453-456 
description of, 443-446 
irreducible inequivalent, 456—458 
labels, 636—637 
labels for, 451—452 
molecular orbitals and, 452—453 
one-dimensional, 444, 454, 473 
two-dimensional, 444, 454 
Repulsive energy, 588 
Resonance energy, 281 
Resonance energy per electron, 281, 282f 
Resonance integrals, 249 
Rest mass 
description of, 14 
of electron, 631 
of neutron, 631 
of proton, 631 
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Restricted Hartree—Fock energy, 357, 360 
Rigid-rotor model, 117-118 

Ring of constant potential, particle in, 50-52 
Rotation axis, 436 

Rotation matrix, 313 

Row vector, 309 

Russell-Saunders coupling, 152 

Rydberg series, 232 

Rydberg state, 232 


S 
ls atomic orbitals, 498, 587—590 
2s orbital, 101 
Scalar, 309—310, 312 
Scanning tunneling microscopy, 575 
Schmidt orthogonalization, 175, 468 
Schrödinger equation 
in atomic units, 110 
center-of-mass coordinates, 90 
for circular motion, 50 
description of, 19-20, 22-23, 89-91 
eigenvalues, 92-93 
in free particle in one dimension, 47-48 
harmonic oscillator 
asymptotic behavior, 74—75, 
78-79 
description of, 72-73 
differential equation for q(x), 
75-16 
energy spectrum, 79-80 


as a power series, 76-77 
recursion relation for, 77—78 
simplifying of, 74 
wavefunctions, 80-81 
higher-energy solutions for, 98—105 
lowest-energy wavefunction, 93-97 
for one-dimensional harmonic 
oscillator, 72—73 
for particle in the one-dimensional 
square wall, 54 
for periodic structures, 528 
postulate for, 168—169 
quantum numbers, 97—98 
separation of variables, 105—106 
time-dependent, 168-169, 180 
Second-order Stark effect, 411 
Second-order structure, 485 
Secular determinant, 200-201, 211 
Secular equation, 200 
Self-consistent field 
acronyms, 386 
atomic orbitals, 147 
configuration interaction, 360—365 
definition of, 348 
description of, 146—147 
equations, 622 
hamiltonian, 349 
Hartree—Fock limit, 357 
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Koopmans' theorem, 358-360 
multiconfigurational, 367 
polyacetylene results, 552—554 
total electronic energy, 352—353 
wavefunction for, 349-350 
Self-consistent field—molecular orbitals, 
384—386 
Separated-atom basis, 208 
Separated-atom energy, 487—488 
Separated-atom limits, 208 
Separated-atom orbitals, 228f 
o bond network, 295 
o- separability, 244—246 
Sigmatropic shift, 519 
Similarity transformations, 314, 459 
Singlet, 142 
Single-valued function, 21 
Singular matrix, 314 
Slater determinants, 137—139, 349, 369, 622 
Slater-type orbitals 
basis sets, 353-354 
contracted Gaussian functions for, 356 
description of, 146-147, 324 
minimal valence, 471 
p-type, 354 
Spectroscopic selection rules, 417—420 
Spectroscopists, 14 
Spherical harmonics 
angular momentum and, 110-115 
definition of, 111 
Spherical polar coordinates, 90, 217, 454 
Spherically symmetric potential, 91, 116 
Spin forbidden, 418 
Spin states, 136 
Spin-free density function, 166 
“Split shell” wavefunction, 195 
Splitting, 154 
Splitting constants, 271—275 
Square 
absolute, 21, 461 
of electromagnetic wave, 21 
rotation effects on, 431f 
Square matrix, 311 
Square symmetry, 266 
Square-integrable function, 22 
1s2s configuration of helium, 138-144 
Standing waves 
in clamped string, 7-9 
description of, 3-4 
Stark effect, 411 
State correlation diagram, 519f 
State energy, 501 
Stater-type orbitals, 208 
Stereospecific reactions, 509 
Stilbene, 283 
Strain energy, 284 
Subjacent molecular orbitals, 506 
Sum of squares of dimensions, 457-458 
Superjacent molecular orbitals, 506 


702 


Symmetric matrix, 310 
Symmetry 
in correlation diagrams, 514 
integration and, 472-476 
reflection, 205 
Symmetry elements, 436—441 
Symmetry operations, 431, 439 
Symmetry orbitals 
definition of, 468 
description of, 220—222, 515 
generating of, 467—470 
for homonuclear diatomic molecules, 
227f 
unnormalized, 468 
Symmetry point groups, 431—434 
Symmetry-forbidden reaction, 513, 517 


T 

Term symbols, 152 

© equation, 107—108 

Third-order structure, 485 

Three-dimensional Bernal graphite, 571f, 572 

Three-dimensional box, particle in, 53-56 

Time-dependent differential equation, 6 

Time-dependent Schrédinger equation, 
168-169 

Time-dependent states, 180—185 

Time-independent wave equation 

description of, 6, 8f 
Schrödinger. See Schrödinger equation 

Topological determinant, 250 

Torsional angle change, 485 

Total electronic energy, self-consistent field, 
352-353 

Total extended Hückel energy, 331—332 

Transformation matrix, 314 

Transposing of a matrix, 309-312 

Transverse waves, 5 

Traveling waves, 1—3 

Trimethylenemethane, 292 

Triplet, 142 

Tunneling, quantum-mechanical, 41 

Two-dimensional graphite, 569, 570f 

Two-dimensional periodicity, 562—565 

Two-dimensional representations, 444, 454 


U 

Uncertainty principle of Heisenberg, 18-19, 
31, 114, 179 

Ungerade, 213, 215, 548 

Uniform electrostatic perturbation, 396—403 

Unit matrix, 311 

Unitary transformation, 314, 445—446 

United-atom atomic orbitals, 227f-228f 

United-atom limits, 208 

Unperturbed eigenfunctions, 395 

Unperturbed energy, 392 

Unrestricted Hartree-Fock energy, 357 
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V 
Vacuum permittivity, 631 
Valence state ionization energy, 327 
Variables 
conjugate, 18 
separation of, 105-106 
Variation method 
linear, 197-206 
nonlinear 
calculations, 196—197 
for helium atom, 194—197 
for hydrogen atom, 191—194 
orbital approximation, 233-235 
spirit of, 190 
Variation principle, 178 
Variational wavefunction, 231—233 
Vectors 
addition of, 310-311 
column, 309 
matrix multiplication of, 311 
multiplication of, 310—311 
orthogonality of, 457 
in reciprocal space, 562-565 
representation, 457 
two-dimensional, 312 
Vertical planes, 438-439 
Vibrations of diatomic molecules, 84—85 
Virial relation, 373 
Virial theorem 
for atoms, 624—627 
for diatomic molecules, 627—628 
Virtual molecular orbitals, 362 
Volume-weighted probability density, 95f 


W 

Wi O,, 401-402 

Walden inversion, 520 

Walsh diagrams, 495—505, 497f, 503f-504f 

Wave(s) 
amplitude of, 2 
classical equation for, 4-7 
compression, 5 
electromagnetic, 12-13 
frequency of, 3 
harmonic, 2, 3f 
light as, 10 
probability, 13 
standing, 3-4, 7-9 
transverse, 5 
traveling, 1-3 
wavelength of, 2 

Wave profile, 1 

Wavefunctions 
antisymmetric, 136, 213 
description of, 32-35 
determinantal, 349—350 
Dirac delta function as, 178 
gerade, 213, 215 
hamiltonian, 144 
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of harmonic oscillator, 73—74 

Hartree—Fock, 358, 375t, 627 

lowest-energy, 93-97 

nondegenerate, 448 

normalization of, 81—82 

orthogonality of, 37-38, 81-82 

postulate, 166—167 

Schrödinger equation for harmonic 
oscillator, 80-81 

self-consistent field, 349—350 

Slater determinantal, 137, 139 

“split shell,” 195 

symmetry of, 35-37, 47, 136, 213 

ungerade, 213, 215 

variational, 231—233 

zeroth-order, 393 


Wavelength 

de Broglie, 15 

description of, 2 
Wavenumber, 14, 528 
Wigner-Seitz unit cell, 564 
Wolfsberg—Helmholtz relation, 327 
Woodward-Hoffman rule, 510-511 
Work function, 12 


Z 

Zeeman effect, 154—156 

Zeeman splitting, 116 

Zero average momentum, 184-185 
Zero differential overlap, 385 
Zero-point energy, 31 
Zeroth-order wavefunction, 393 
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